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Foreword 


This issue marks the 40th anniversary of the Bell System Technical 
Journal, which has been published continuously since July, 1922. 
Through these 40 years the B.S.T.J. has had one basic purpose: to be a 
journal of definitive technical papers that help to record the progress of 
Bell System communications. The B.S8.T.J. is thus an important factor 
in carrying out the Bell System policy of prompt publication of new 
research, development and systems engineering knowledge. 

Although the pace of change has increased in recent decades, B.S.T.J. 
papers from the beginning have dealt with many themes that are still of 
fundamental importance to communications research and technology. 
Among the important early contributions, for example, were articles by 
Harvey Fletcher on the nature of speech, by R. V. L. Hartley and T. C. 
Fry on binaural hearing, by George A. Campbell on wave filters, and by 
EK. C. Molina on traffic theory. Early articles of significance to telephone 
transmission included those on carrier transmission by Hartley, long 
cable circuits by A. B. Clark, and radio communications by Lloyd Espen- 
schied. K. K. Darrow published the first of a notable series of articles 
reporting advances in physics, and among the first articles in the area 
of materials research, G. W. Elmen and H. D. Arnold reported on 
Permalloy. 

These early contributions helped to establish standards of technical 
excellence that have been a challenge and inspiration to subsequent 
authors and to those who have served as editors and advisors. As new 
knowledge about communications advances even farther in scope and 
depth, the B.S.T.J. will continue to serve both the Bell System and the 
entire scientific and engineering community. 
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Automatic Trouble Diagnosis of 
Complex Logic Circuits 


by S. H. TSIANG and W. ULRICH 
(Manuscript received November 2, 1961) 


This paper deals with the problem of maintaining the most complex 
portion of an experimental electronic telephone switching system, the cen- 
tral control. New and more effective automatic trouble detection and diag- 
nostic techniques were used. In order to utilize these techniques effectively, 
a maintenance dictionary, 1.e., a table relating trouble indications with 
corresponding faulty plug-in package, had to be produced. The system 
itself was utilized to create this dictionary. Over 50,000 known faults were 
purposely introduced into the central control to be diagnosed by its diag- 
nostic program. The corresponding test results were then recorded via a 
high speed output. Finally, these test data were sorted and printed in dic- 
tionary form by a computer. 


I. INTRODUCTION | 


In November, 1960 Bell Laboratories started its field trial of an 
experimental electronic telephone switching system in the town of 
Morris, [llinois. This system (extensively described elsewhere)! was 
one of the first attempts to introduce electronics on a large scale into 
telephone switching and as such, brought us face-to-face with a new 
class of problems, especially in the field of maintenance of centralized 
telephone equipment. 

The problems of maintaining an electronic telephone switching sys- 
tem are formidable, but as will be seen presently, the tools naturally 
available for this maintenance are powerful. This paper will deal with 
the problem of maintaining the most complex portion of the experi- 
mental electronic telephone switching system, the central control. 


1177 
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II, RELIABILITY AND MAINTENANCE OBJECTIVES 


The character of a commercial telephone system as a whole imposes 
unusual maintenance objectives for its component parts. The vital role 
of a central office demands that it have an extremely low downtime. 
At the same time, since the telephone system is so widespread and 
cannot be concentrated in a few key locations, another objective is that 
it be maintainable by telephone system craftsmen. 

These are extremely difficult requirements. In order to maintain 
a sufficiently low downtime with devices currently available, it is neces- 
sary to provide some redundancy in the equipment so that single trou- 
bles do not cause the entire system to fail. The simplest form of re- 
dundancy, and in the present state of the technology probably the least 
expensive, is the simple duplication of all common equipment in the 
system. Thus, where only one memory store is required for running the 
system, two are provided; where only one control unit is required for 
running the system, two are provided, etc. 


III. DESCRIPTION OF THE MORRIS ELECTRONIC SWITCHING SYSTEM 


Fig. 1 is a block diagram of the system. The customer’s line comes 
into the office and has an appearance on the network and on the scan- 
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Fic. 1—General system block diagram. 
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ner. The appearance on the network permits him to be connected to 
another. customer, to dial tone, or to a ringing signal. The appearance 
on the scanner permits the system to detect the status of his line. 
When a customer lifts his telephone handset off the cradle, a loop is 
closed to the central office, changing the voltage at a point that is de- 
tectable by the line scanner. Similarly, when dialing, this loop is opened 
for brief intervals; for example, if a 9 is dialed the circuit is opened nine 
times. 

The flying spot store is used to store the program of the system and 
the individual translation information associated with each customer. 
The barrier grid store is used to store information as to the status of 
the lines in the office (busy, idle, or dialing) and to assemble dialing 
information during the time that a customer is actually dialing. The 
signal distributor is used to operate the test relays in the office and to 
signal to distant offices. It is also used to switch between working and 
standby units. The central control must coordinate the actions of all 
these units, i.e., take the output of the line scanner and the barrier 
grid store, act upon these outputs according to the instruction given by 
the flying spot store, and use these results either to set up a connection 
in the network or write further data in the barrier grid store; it must 
also request the next instruction from the flying spot store. 

These facilities can be used for testing the system in a relatively 
sophisticated manner. Available for the purposes of setting up a call is 
a rather complex system for processing data. Testing is also actually 
a data-processing action. Thus, when a faulty unit is being tested, we 
get the instructions for testing from the flying spot store and we get 
the test results either from match circuits between units performing 
supposedly identical functions or from the line scanner which has test 
probes into various circuits. Test results are then assembled in the 
barrier grid store and are eventually typed out using the teletypewriter 
(which is also under the control of the signal distributor). 

- As mentioned earlier, all important and common equipment is dupli- 
cated. This leads to a two-fold advantage for testing. First, a rather 
sophisticated data-processing machine is always available which can 
automatically apply tests, and interpret and report test results. Second, 
an identical unit, presumably in good working order, is always availa- 
ble; the output. of this unit may be compared with that of the circuit 
being tested.:A match of outputs indicates a successful performance of 
a test, while‘a mismatch indicates a: test failure. 

The program for controlling tests is stored in the flying spot store. 
Storage space in this store is both expensive and limited, Therefore, it 


1180 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1962 


is not possible to store a program sufficiently flexible so that it can 
print out the identity of a defective package. Instead, the program 
prints out test results and these test results are then compared with 
test results that are anticipated by the designer and are placed in a form 
referred to as the ‘‘maintenance dictionary.” 


IV. CENTRAL CONTROL MAINTENANCE 


The central control is the basic control unit of a real-time, special- 
purpose, program-controlled data-processing machine. It controls the 
flow of information to the stores, markers, scanner and signal distribu- 
tor. It is duplicated. There are over 8000 circuit packages (6500 tran. 
sistors, 45,500 diodes) in both central controls. The two central controls 
normally operate in parallel, executing the same instruction and _ per- 
forming the same operations, even though only one of the controls, 
the “active” central control, is used to control such system output cir- 
cuits as the markers and signal distributor. Certain key outputs of the 
two central controls are matched; trouble in either central control leads 
to a mismatch. Any mismatch initiates a special fault-check program, 
which is used to decide which, if any, of the central controls has the 
trouble. The program causes central control to make a number of de- 
cisions; if the active central control makes any incorrect decisions, it 
switches itself out of service. If a decision made by the active central 
control is correct but a mismatch occurs, it indicates a trouble in the 
standby central control. In case the active central control is defective 
to such a degree that it cannot execute the fault-check program, a time- 
out circuit will automatically switch this control out of service. 

After the fault-checking program has been completed, a diagnostic 
program is started. Whenever the system finds a spare time period of 
one millisecond when no telephone operations are required, tests are 
performed on the suspected central control. The results of these tests 
are then typed out using the system teletypewriter. These results must 
then be interpreted with the aid of a maintenance dictionary. 


V. THE CENTRAL CONTROL MAINTENANCE DICTIONARY PROBLEM 


For most of the units of the system, a maintenance dictionary can be 
specified with a reasonable amount of effort by the designer of the test 
programs because the units are functionally comparatively simple. For 
example, the characteristics of different types of faults which may occur 
in the scanner lead to a relatively small number of simple types of pat- 
terns which can be readily examined. However, central control is much 
more complex both in its functions and in its circuitry. The type of 
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symmetry which is characteristic of scanner, signal distributor, network, 
and even the stores is totally absent in the case of central control. Even 
the number of the tests which has been selected (about 900) attests to 
this difference of complexity. The work of preparing a dictionary by 
hand would have been formidable and the resulting dictionary would 
have been very incomplete. As a result, the automatic means described 
in this paper were used to produce a dictionary that was both as com- 
plete as the diagnostic tests would permit and which required only a 
reasonable amount of development effort. 

The scheme was to introduce about 50,000 faults into central control 
and get the test results associated with each fault. These test results 
could then be sorted and finally printed as the desired dictionary. 

This scheme had a number of advantages over other techniques. 
Tirst of all, while considerable effort would be required to design the 
basic mechanism and circuits for carrying out this scheme, once this 
effort had been expended, the number of troubles which could be ana- 
lyzed could be made very large without enormous additional expendi- 
ture of effort. Secondly, no errors of analysis could creep into the sys- 
tem. Thirdly, because computer analysis would be required for the final 
production of the dictionary anyway, it would be possible with little 
additional effort to create a dictionary printout format that would 
closely resemble the format of the test results which are normally ob- 
tained by the system. Finally, the actual process of creating the dic- 
tionary could be deferred to a relatively late date so that the dictionary 
would be based on the most up-to-date version of central control and 
the most up-to-date version of the diagnostic program. 

The basic scheme for deriving the dictionary data is discussed below. 
First, the system must be switched to a special dictionary mode of opera- 
tion, dropping all telephone work. (Since this is done in the laboratory, 
not in a working office, this is not at all serious.) Then, information is 
fed to the system concerning the identity of the package whose possible 
faults are to be simulated. Next, the faults are simulated. After each 
fault, the system diagnostic program is started; a punched paper tape 
output is used to record the identity of the package, the number of the 
fault, and a complete report of the tests that failed when that fault was 
simulated. Such an output record is generated for each fault that is 
simulated. These records are then sorted and printed in suitable form 
by a computer. 


VI. GENERAL DESCRIPTION OF DICTIONARY PREPARATION 


A number of special pieces of equipment were designed for the prepa- 
ration of the dictionary. Two fault-simulation units were used to simu- 
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late essentially all the packages and their faults. A test control unit 
was used to sequence the faults automatically in a given package and 
coordinate all special recording equipment with the system. 

Information identifying the package whose faults were to be simu- 
lated was stored on a regular 5-channel teletype tape and was automati- 
cally fed into the system at the proper time by a conventional 100 
words per minute reader when the corresponding circuit card was to be 
tested. 

A high-speed 1000 words per minute (100 characters per second) 
TELETYPE tape punch was used to record the test results. The output 
data for each fault consists of the package identity and its location, the 
fault number, and the test results. 

Tig. 2 shows the functional block diagram and peripheral equipment 
involved in collecting the test data. 

The test control unit controls the over-all operation of the fault 
simulation and data gathering. The “normal-test’”? switch on the test 
control unit is first operated to the test position. This requests the sys- 
tem to go into the dictionary mode. When the system has reached a 
convenient point in its program, it stops the central control clock and 
turns over the control to the test control unit. The system cannot start 
again until a signal is received from the test control unit. 

When the “‘system-off” lamp on the test control panel is turned on, 
the package on which faults are to be simulated can be pulled out and 
replaced by the fault-simulation unit. Then the “automatic test’? push- 
button on the test control unit is operated. This tells the system to 
start an automatic test. 

As soon as the automatic test signal is received, the system requests 
the package information tape reader to read in one package identity. 
Since the sequence of packages to be tested is the same as that listed on 
the package information tape, the identity will be the package currently 
under test. This information is stored by the system and later affixed to 
the corresponding diagnostic test results. 

When the package identity has been stored, the system commands 
the test control unit to switch in the first fault and immediately initiates 
a complete diagnosis. Upon completion of the diagnostic tests, the sys- 
tem delivers the final test results to the high-speed tape punch. The 
system then requests the next fault to be switched in and another round 
of diagnostic tests and data punching begins. 

The same cycle of operation is repeated until the fault-simulation 
unit informs the test control unit that the last fault on this package has 
been tested, in which case the test control unit puts the fault-simulation 
unit in a no-fault condition and asks the system to carry out the same 
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Fic. 2—Dictionary data acquisition: functional block diagram. 


diagnostic routine. The test result output channel is now switched from 
the high-speed tape punch to the existing system teletypewriter, so 
that the test results will be easily readable. Normally, it is expected 
that there will be no test failures at the no-fault condition. Therefore, 
the teletypewriter will print out only the ALL TESTS PASSED message. 
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The main reason for making such a test is to check the correct func- 
tioning of the fault-simulation unit, i.c., that the substitution of the 
fault-simulation unit does not itself introduce a fault. The test also 
provides an auxiliary means to monitor the proper operation of the 
system. 

When diagnosis for the no-fault condition is completed, the system 
performs the routine tests on all major system units; this enables the 
system to detect any trouble. After the system has succeeded in going 
through all the routine tests, it again turns itself off and lights the “‘end 
of automatic test’? lamp on the test control unit panel. This completes 
the fault simulation for one package. The entire operation from the 
first fault to the last fault, then to the no-fault state, is done automati- 
cally following a single operation of the “automatic test’? pushbutton. 

During the time that the dictionary data was being gathered, a modi- 
fied system program was used. The system did not spend a large amount 
of time performing unnecessary telephone work while it was actually 
creating the central control dictionary. 

A high-speed 1000 words per minute TELETYPE reader was em- 
ployed to read the tape as soon as it was punched. On the paper tape 
seven channels were used, six for the test data and one for the lateral 
parity bit generated by the system. The output of the reader was fed 
to a tape error detection unit which checks the parity of each charac- 
ter and the block length of the test records. The number of characters 
in the test record for each fault should be the same. 

Before the data could be sorted, they were first converted from the 
punched paper tape to magnetic tape. A computer was then used to 
sort the test data. The actual dictionary was printed directly by a tape 
controlled printer. 

In the dictionary project, work was divided into several major parts. 
A brief description of each part is given in the following sections. 


VII. FAULT SIMULATION 


The two central controls in the system are identical. Normally, one 
serves as the active unit and the other one as the standby or vice versa. 
The active unit always diagnoses the one in trouble. Therefore it is 
necessary to insert faults in only one of the central controls. The total 
number of circuit packages in which faults have to be simulated is about 
4000. 

About 49 different types of packages are used in central control. 

All the faults simulated are of the catastrophic type. Faulty diodes 
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are simulated as being either shorted or open, resistors as open and 

transistors are simulated as either permanently on or off. 

_ Only single troubles are simulated. It would be totally impractical to 
simulate multiple faults. 'urthermore, routine tests (at 100 millisecond 

and 1 second intervals) and matching between two central controls are 

performed often enough so that it is reasonable to assume that a single 

trouble will be detected and diagnosed before another fault develops. 

Obviously it would be equally impractical to simulate all marginal 
conditions. It is hoped that a majority of the marginal conditions, if 
they result in trouble at all, will give the same characteristic result as a 
corresponding catastrophic fault. For example, if the reverse imped- 
ance of a diode is too low, a gate may behave in the same manner as if 
the diode were shorted. 

The number of faults per package varies from 2 to 30. Over 50,000 
faults were simulated in order to create the dictionary. 

In the packages containing only diodes, troubles are simulated for 
each gate; in the other type of packages, mainly those containing tran- 
sistors, only faulty output conditions are simulated. The results of the 
diagnostic tests are based on the output of a package rather than the 
individual component contained therein. 

An example of fault simulation for a 2-input OR gate is given in lig. 
3. Each diode is shorted and opened by the operation of a different 
relay. Only an open resistor is simulated. If a shorted resistor were 
simulated it might cause damage to other components in the system. 
TFurthermore, shorted resistors are relatively rare trouble conditions. 
The fault-simulation unit for the AND gate is similar, except that no 
resistor is involved. 

The transistor packages are simulated differently. For example, four 
possible faulty output conditions are simulated in a transistor flip-flop 
circuit. 

1. Set permanently high, reset permanently low. 

2. Reset permanently high, set permanently low. 

3. Both high. 

4, Both low. 

All faults are controlled by the operation of different relays. Fig. 4 
shows how this is done. The same simulation technique is used for all 
other types of transistor and miscellaneous circuit packages. 


VIII. MODE OF OPERATION 


When test data was being collected in the preparation of the diction- 
ary, the system operated in two basic modes: normal and testing. 
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Fig. 3—Fault simulation on OR gate. 


In the normal mode the dictionary test equipment is effectively dis- 
connected from the system and the system is allowed to operate in its 
normal condition. If a trouble other than the one intentionally intro- 
duced is detected in the system during the course of a test, the system 
is placed in its normal mode so that any faults may be diagnosed and 
corrected. 

In the test mode, the following different tests can be initiated by 
operation of appropriate push buttons: 

1. Automatic test. This has already been described under the general 

description. 

2. Manual test. The system can be asked to conduct diagnosis on any 
desired fault including the no-fault case. The results of a manual 
test are recorded on the system teletypewriter. Fault simulation 
is accomplished by means of the ‘fault advance”? pushbutton on 
the test control unit. The manual test, made from time to time, 
is used to check the special dictionary programs and equipment. 
The manual test data are also used to spot-check the output data 
on the paper tape. 
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Fig. 4—Fault simulation on transistor flip-flop. 


3. Skip package identity. This is a special feature provided so that 
a package identification block stored on the package information 
tape can be bypassed. When the “skip-package” button is operated, 
the system controls the tape reader and advances the package 
tape to the next block. This provision is made so that crrors made 
during the preparation of the information tape can be corrected. 


IX. SYSTEM CIRCUIT REQUIREMENTS 


All communications between external equipment and the system are 
accomplished using the scanner and signal distributor. The scanner 
provides central control with an information access to lines, trunks, 
and test points. Signal-distributor outputs provide access from the 
central control to one of a large number of outputs. Very minor modifi- 
cations in the system were needed to prepare the system to create the 
dictionary; the modifications consisted of adding a few temporary cables 
and connectors. The only wiring required was the jumper wiring from 
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the external equipment to those scanner and signal-distributor points 
used for the dictionary project. Altogether, 12 scanner points and 23 
signal distributor points were used. 


X. PACKAGE INFORMATION TAPE 


The package information tape records the type and location of each 
package to be tested. This information is read into the system via a 
conventional 100-speed (10 characters per second) tape reader; the iden- 
tity is read once for each package to be tested. The system controls the 
tape reader under the command of the test control unit. 

Only the pertinent codes associated with the package identity are 
stored. The system rejects all functional codes such as space, carriage 
return, etc. This package information is later attached with its associ- 
ated diagnostic test results in the final output for each fault. 

Eleven characters are assigned for the identification of each package. 
The first five specify the location, the next six indicate the type of pack- 
age. 

The package information was first tabulated at random manually 
from an apparatus designation chart. IBM cards, one for each package, 
were prepared from the tabulated list, and were manually checked. 
After the cards were sorted according to the predetermined order in 
which the packages were to be tested, a printed list was prepared. This 
list was used as the master package test schedule during data gathering. 
The punched cards were then converted into a fully perforated, 5-chan- 
nel paper tape in TELETYPE code. From this tape, a final printed, 
chadless tape was prepared. A printed tape was used so that the opera- 
tor would be able to read it. Thus, a check could be made before a 
package identity was read into the system. A printed tape so obtained 
would go through a final check. 


XI. SYSTEM PROGRAM MODIFICATIONS 


Modification of existing programs, mainly the central control diag- 
nostic programs, and the flow charting and coding of a number of new 
program segments were required. About 1000 new program words were 
added as a result of the changes and additions made. Modifications 
were necessary in order to have the system operate in different modes 
and perform different tasks. 

During the period of dictionary preparation, the system had two 
additional teletypewriter communication channels: one input channel 
from the conventional 100-speed tape reader, for reading the package 
information into the system, and one output channel to the high-speed 
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tape punch for test data recording. In addition, the existing teletype- 
writer output was retained for conveying auxiliary information to the 
test team. 

Most of the new program segments were for data handling, coordina- 
tion of different test modes, control and reading of the package in- 
formation tape, recording of test data on the high-speed punch, ete. 


XII. PUNCHED PAPER TAPE TO MAGNETIC TAPE CONVERSION 


The sorting of the test data was done on an IBM-704 computer. The 
test data recorded on paper tape in binary code had to be converted 
into magnetic tape, compatible with the 704. Conversion was made on 
an IBM-9200 machine. Information was processed at the rate of 500 
characters per second. 

The magnetic tape recording was an identical image of the paper 
tape. The code conversion was somewhat different from conventional 
conversions. In the 7-channel paper tape, the test data utilizes all 64 
combinations of the first 6 channels, and the 7th channel is used for an 
odd parity bit for each character. 


XIII. DATA RECORDING AND PROCESSING 


13.1 Data Recording 


Over 900 central control diagnostic tests are grouped into eight 
phases. Each phase diagnoses faults of certain parts of the central con- 
trol. 

During the normal diagnostic operations, the test results are recorded 
at the end of each phase. The printout consists of two parts. The first 
part is the system component identification, i.e., whether it is central 
control 0 or central control 1, together with the phase number (A, B, 

- or H). The second part is the test results. Only the numbers of 
those tests that fail are printed out. Each has a 3-digit octal number. 
The test results, therefore, are variable in length, depending on how 
many tests have failed. 

For dictionary preparation, a binary coding system was adopted for 
recording test data. Every test in the diagnostic program was repre- 
sented by one bit on the paper tape. This was also true for the magnetic 
tape, whose recording was identical to the paper tape except with 
higher longitudinal density: 200 versus 10 characters per inch. A “1” 
or “hole” means that the corresponding diagnostic test has failed and a 
“0” or ‘‘no hole” indicates that the test passed. Each character consists 
of 7 bits, 6 for registering six different test results, and 1 for parity. The 
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basic advantage of using this binary coding system is that it makes 
the sorting process easier. The number of characters in the test record 
for each fault.is the same regardless of the number of test failures. 


13.2 Data Processing 


The data processing for the dictionary was done by an IBM-704 
computer and the actual printing of the dictionary by a high-speed 
tape-controlled printer. 

The 704 program is quite complex and involved. It can be divided 
into three major parts: phase sorting, test sorting, and data printing. 
The entire program is about 2500 words. 

In the central control dictionary, all diagnostic test results were ar- 
ranged in an orderly manner so that, given a certain sequence of test 
failures, the maintenance man could easily look for the same sequence 
in the dictionary. Associated with each sequence, one or several package 
identities and locations are listed. When several appear, failure of any 
one of them could result in such a test failure sequence. 


13.2.1 Phase Sorting 


All test records were sorted first according to their phase information. 
The test record for each fault contains three parts: 

1. The package identification and fault number. 

2. The individual test results. 

3. Phase information, which is represented by 8 bits, one for each 

phase. If all tests have passed in a phase, the corresponding bit 
will be 0; if one or more tests have failed, the bit will be 1. 

The purpose of this phase sorting is to arrange all records according 
to the alphabetical order given by the phase information. For example, 
consider a record a with phase information 00101100 and a record 8 
11000100. The 8 bits represent phases A, B, C, D, E, I, G, H. Omitting 
all zeros (that is, those phases which do not have any test failures), 
phase information for record a becomes CEF and record 8 ABI. After 
the phase sorting, record 8 will be placed in front of record a, that is, 
ABF in front of CEF, as are the words in a regular dictionary. Sorting 
by phase involves only part 3 of the test record. 


13.2.2 Test Sorting 


After the test records were sorted by phase information, they were 
subsorted in accordance with test failures. This was necessary because, 
with 255 possible phase combinations and over 50,000 different records, 
many records have the same phase combinations. The test sort takes 
all the records with identical phase information and further arranges 
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them in a numerical order according to the binary test results which 
are given in part 2 of each test record. The actual test results consist 
of 916 bits without counting the added dummy bits. Dummy bits were 
added to make up a complete 704 machine word (36 bits) required by 
the computer. Each bit corresponds to a particular test. A ‘1’ or “hole” 
means that the test which the bit represents has failed. 

As an example, assume three records, a, 8, and y, having identical 
phase information and, for simplicity, only 6 diagnostic tests. The test, 
failure information on these records is the following:a = 00101 1, 
6B=110010,andy = 01000 O. The leftmost bit corresponds 
to test No. 1. Therefore, tests No. 3, 5, and 6 have failed in record 
a; 1,2, and 5 in 8; and 2 in y. After test sorting, the three records will 
be arranged in this order: 


Record Binary Test Data ee Pete eeciehs 
B 110010 Les ABE 

¥ 0 10 0 0 0 2 

a 001011 3, 5, 6 CEF 


It can be seen that the binary test data is so arranged that the analo- . 
gous alphabetic sequence is in true alphabetic order. 


13.2.3 Dictionary Printing 


The printing of the actual dictionary was done by an IBM high-speed 
tape-controlled printer. When all the records were properly sorted by 
phase and test information, they were converted and printed in a dic- 
tionary form. 

The test data collected for the dictionary was in binary form. This 
was converted to the form used by the system in typing out test results. 
Whenever a bit is ‘11”’, the location of that bit in the test recording indi- 
cates the phase and test number of this test failure. This is converted 
into a 3-digit octal number. The test data format in a dictionary, there- 
fore, looks exactly like the system teletypewriter printout in a normal 
diagnosis. Identical test data produced by different faults is entered 
only once in the dictionary, but all the faults are listed. Fig. 5 shows a 
sample sheet of the dictionary. 


XIV. EQUIPMENT EMPLOYED 


14.1 TELETYPE Equipment 


14.1.1 Highspeed Tape Punch and Receiving Signal Converter 


A 1000 words per minute (100 characters per second), tape punch 
was used to record test data from the system. It was equipped for punch- 
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Fig. 5—A sample sheet of the Dictionary (with test results and address information identified). 
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ing eight channels, although only seven channels were used. The high- 
speed tape punch is a rotating device, which can be operated only at a 
particular point in the cycle. The receiver signal converter acts as a 
buffer between the system and the punch. This equipment has coded 
input circuits, consisting of two wires each, so arranged that informa- 
tion appearing on these leads will be punched on the tape at the proper 
point in the punch operating cycle. Since the punch can accept informa- 
tion only at a certain point in a cycle, a buffer store is included in the 
receiving signal converter, where the coded signal is stored until the 
punch is ready to accept it. 


14.1.2 High-Speed Tape Reader 


The reader operates at a speed of 1071.4 words per minute (107.14 
characters per second). It is a parallel output device, equipped for eight- 
channel operation. The reader was used in conjunction with the tape 
error-detection unit to check the output paper tape. 


14.1.3 Transmitter Distributor 


A transmitter distributor was used to read the package information 
tape. This unit operates at 100 words per minute (10 characters per 
second) and is equipped with a set of five parallel code-reading contacts. 
These contacts were used to feed the code signals to the system. 


14.2 Equipment Built 


14.2.1 Test Control Unit (F7g. 6) 


The test control unit was the master control for the dictionary tests. 
This unit coordinated the operation of the system and all peripheral 
dictionary equipment. Since the operating time of any type of logic 
employed in this unit was insignificant compared to the total time con- 
sumed in changing the packages, recording the test data, loading, and 
unloading paper tapes for the punch, etc., relay logic was used. The 
circuits were so designed that any improper operation of pushbuttons 
and control levers did not cause erroneous data to be recorded. 

The test control unit performs the following major functions: 

1. Controls the system clock: when the system clock stops, it will 

not start again until a command is received from this unit. 

2. Governs the reading of the package information. tape. 

3. Communicates with the system and provides instructions for per- 


1194 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1962 





Fria. 6—Left: fault-simulation unit for gate packages. Center: test control unit. 
Right: fault-simulation unit for transistor packages, with flip-flop package 
plugged in. 


forming an automatic or manual test or a package identification 
skipping operation. 

4, Instructs the fault-simulation unit as to when a particular fault 

by itself or as a part of a series of faults should be simulated. The 

simulation and sequencing of faults during an automatic test are 

accomplished automatically. . 

Allows manual insertion of any desired faults with the aid of the 

fault reset button, fault advance button, and fault number indica- 

tor. 

6. Provides visual indications for the following: (a) mode of opera- 
tion: (test or normal), (b) which fault is being tested, (c) occur- 
rence of an unsimulated trouble in the system, (d) completion of 
either an automatic or manual test or the skipping operation, (e) 
the off state of the system. 


or 


14.2.2 Fault-Simulation Unit 


A total of 49 different types of packages are employed in the central 
control units. A study of the type and number of faults to be introduced 
in all the circuit packages used revealed a logical division between gate 
packages and other types of packages, such as flip-flops, amplifiers, 
etc. Two fault-simulation units were designed which simulated essen- 
tially all the circuit cards in the central control. 

The fault-simulation units were wired to create the required circuit 
packages with the necessary fault-simulation features by means of a 
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plugboard arrangement. The plugboard consisted of a 150-pin connec- 
tor. The socket portion of this connector was wired to various compo- 
nents provided in the fault-simulation unit. For each circuit package, a 
separate plug was used which contained the necessary jumper wires to 
simulate the corresponding package. 

To minimize the stray capacitance, the fault-simulation units were 
laid out to be as compact as possible. Each unit was about 7” x 7” x 
4”, as shown in Fig. 6. Microminiature relays were used inside the units 
to switch the various faults. During testing, they were placed in close 
proximity to the package socket. The AND and OR gate fault-simula- 
tion unit consisted of 30 relays plus the components necessary to simu- 
late any one of the 16 types of gate package. 

The second fault-simulation unit was used for transistor and miscel- 
laneous packages. Most of the faulty conditions for these packages were 
simulated on the output terminals; therefore, only a few types of circuit 
cards needed to be wired up from components provided in the unit 
itself. In the majority of cases, a good package was plugged into a 
socket supplied on the fault-simulation unit, and the conditions on the 
output terminals were controlled by relays inside the unit. As in the 
first unit, plugboard connectors were provided for each simulated type 
of circuit packages. 

The sequencing of faults was controlled by the test control unit. A 
signal from the fault-simulation unit was sent to the test control unit 
when all the faults were completed for a package. 


14.2.3 Tape Error Detection Unit 


The tape error detection unit was designed for checking the punched 
paper tape. The code signals were fed into this unit in parallel by the 
high-speed tape reader. Two types of tape errors were detected by this 
unit: parity of each character, and predetermined block length of test 
data. The output tape was checked as soon as it was punched. 


XV. DETAILED PLANNING OF TEST PROCEDURES 


In order. to take up a minimum amount of system time to create the 
dictionary, careful plans were made. The plans included the detailed 
test procedures, manpower requirements, job descriptions for the 
operators, and the equipment operation instructions involved in the 
acquisition of test data. Suggested procedures were also given in case 
of trouble during the dictionary test. All conceivable major and minor 
troubles were analyzed. The effort spent in planning was worthwhile. 
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EXPERIENCE WITH MORRIS CENTRAL CONTROL MAINTENANCE > 


The following points summarize our experiences: 


1. 


Certain troubles were not detected by the diagnostic program. 
These can be attributed to one of the following: 


(a) The particular component in which the trouble was simulated was 


not actually used by the system. 


(b) There was some redundancy in the circuits, creating the possi- 


(c) 


(d 


2. 


~ 


bility of troubles that could not be detected. 
The diagnostic program was not exhaustive. Circuits in certain 
areas have not been covered. Additional tests could have been 
added to the present diagnosis, but it was felt that such an effort 
was not justified for the Morris System. 
Due to the design of the Morris central control, there were limi- 
tations which made it impossible to detect certain troubles. For 
example, a shorted clock diode on an AND gate usually resulted 
in no test failures. The low impedance of the clock supply pre- 
vented the clock pulse from being clamped. However, the margins 
were decreased and occasionally a test failed when this type of 
trouble was introduced. An open clock diode on an AND gate at 
the input to a flip-flop which had no feedback around it usually 
resulted also in no test failures. The flip-flop merely changed 
state, without waiting for the clock pulse which normally initiated 
the change. However, some flip-flops could be operated falsely 
by the noise on the gate leads which could occur before the clock 
pulse arrived. 
Some troubles resulted in inconsistent test results. Most of these 
were due to circuit design. For example, certain troubles intro- 
duced caused complementary functions to be performed simul- 
taneously, such as writing a “0” and a “1” in a memory spot. 
These troubles produced race conditions in the central control 
logic, resulting in inconsistent test results. 


. Some troubles introduced in the standby central control affected 


the operation of the active system. This indicated that better iso- 
lation was required between the two central controls. 


. Many test results were extremely difficult to analyze. Since there 


are some 900 tests made on each fault, even sketchy analysis is 
time consuming. It is very difficult to explain why certain tests 
fail with respect to trouble introduced and to predict all the tests 
that should fail. For these reasons, the dictionary is extremely use- 
ful. Even now, if a set of test failure results is not found in the 
dictionary, analysis presents grave problems. 
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5. The fault-simulation units introduced sufficient capacitance in the 
circuit to cause marginal operation in some of the circuits. By test- 
ing most transistor packages in a fixed state (that is, a flip-flop 
permanently set, etc.), and by reducing the value of diode gate 
resistors, it was possible to obtain meaningful data on all but seven 
packages in central control. These seven persisted, in spite of all 
efforts, in failing tests when the simulated package was plugged 
in and no actual faults introduced. 

6. It appeared evident that a central control dictionary cannot be 
written from an analysis of the circuit with respect to the diag- 
nostic tests. The complexity of the central control made it difficult 
to predict how the central control will perform with any given 
fault. 

7. Even if a dictionary were not produced, extensive trouble inser- 
tion or trouble simulation was necessary to debug and measure 
the effectiveness of the diagnostic program. 


XVII. SOME INTERESTING STATISTICS 


The central control maintenance program has about 7200 program 
words, 6000 of which are for diagnosis. Over 50,000 faults were simu- 
lated. About 250 hours of actual system machine time were used ex- 
plicitly for data gathering. The total data (about 60 million bits) occu- 
pied 93 reels of 1,000 foot, 7-channel paper tape. These in turn were 
converted by an IBM-9200 machine into four reels of magnetic tape. 
The conversion time was about 12 hours. Out of 51,671 records, about 
35 were destroyed and 25 rendered doubtful because of machine errors. 

The sorting program for the IBM-704 computer is about 2,500 words 
long, and was written by one man in about eight months. The total 
sorting time on the IBM-704 computer was 34 hours. 

The time spent in designing the diagnostic program and producing 
the dictionary was about 12 man-years, of which two man-years was 
for designing the diagnostic program and two man-years was for de- 
bugging and modifying it. 

The dictionary consists of 1,290 pages (11” x 14%”), bound into four 
volumes. 


XVIII. DICTIONARY RESULTS 


There are 10,315 different test patterns, 73 per cent of which list only 
one possible package failure, 13 per cent two possible package failures. 
Therefore, for a large number of catastrophic troubles (86 per cent), 
the dictionary will be able to pinpoint the fault to within two packages. 
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Our experience indicates that a man with a few minutes of training 
will be able to consult the dictionary and determine the faulty package, 
usually within two to three minutes, sometimes as long as five to ten 
minutes, and this appears adequate. Our preliminary evaluation of the 
dictionary indicates that it will be able to locate about 75 per cent of 
the troubles. 


XIX. CONCLUSIONS 


The feasibility of producing a central control dictionary by the system 
itself has been proven, and a dictionary has been produced. Considera- 
ble experience has been gained with the maintenance of a large elec- 
tronic logic circuit. Asa result, a number of improvements can be made. 
A great deal has also been learned regarding the limitations of diagno- 
sis due to the central control circuit design. These limitations cause 
inconsistent test results or no test failures. With the improvements 
which can be made in the diagnosis and the circuit design, it appears 
feasible to have a central control dictionary which will be able to locate 
90 per cent or more of all the probable troubles. 

Using the dictionary techniques, the average repair time may be: 
kept very low, and the maintenance was made much easier. Success in 
this area of work has contributed greatly to meeting the initial main- 
tenance and reliability objectives.” 

Considering that this was an initial attempt to solve a very complex 
and difficult problem, the results have been gratifying. Considerable 
headway has been made in automatic diagnostic techniques. However, 
we must develop these techniques further if we are to cope successfully 
with the problems of maintaining the even more complex electronic 
telephone switching system now being developed. 
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APPENDIX 


Description of the Central Control Diagnostic Program 


A complete understanding of the central control diagnostic program 
demands an intimate knowledge of the central control. This appendix 
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will therefore only illustrate the types of tests used and the methods 
for observing test results. 

Tor clarity, it 1s desirable to define a simplified repertoire of instruc- 
tions. A, B, C, D, AA, AB refer to symbolic designations for flip-flops, 
flip-flop groups, or storage locations. 


SM Sample match circuits for a mismatch condition dur- 
ing the process of executing the next instruction. 
G A,B Gate the contents of flip-flop group A to flip-flop 


group B, via bus. 
ST1 AA Set up transfer register 1 (T1) to quantity AA. 
ko, AB Transfer to the address stored in transfer register | 
R1, AB if the reading at AB is 0 or 1 respectively. 
RFFO, C Transfer to the address stored in transfer register | 
RFF1, C if F FC is 0 or 1 respectively. 
WIFE C, D Write the contents of flip-flop C into memory at ad- 


dress D. 
WO, AB Write 0 or 1 respectively into memory of address 
W1, AB AB. 


Match circuits are provided to compare the outputs of the instruc- 
tions to the stores (transfer or advance to next instruction for the fly- 
ing spot store, read or write 0 or write 1 to the barrier grid store) and to 
match the busses of the two central controls. 

In the following examples, each program step is listed, followed by a 
symbolic modifier and by comments. In studying these programs it is 
important to remember that the two central controls are working in 
synchronism, and that the working central control 1s addressing and 
writing into the stores. 

First, to check the ability to make decisions, the following program 
was applied: 


WO AB (AB is any convenient address.) 
SM 
Ri AB 


If a mismatch is detected on the R1 instruction, the standby central 
control has falsely transferred on a reading. The program 


W1 AB (AB is any convenient address.) 
SM 
Rl AB 
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will check the ability of the standby central control to transfer on a 
reading. 

In order to check the ability to write correctly, register R was first 
set up, to all 1’s. Register R has the property that its individual flip- 
flops may be written into memory using the WIF instruction, or may 
be read using the RFFO and RIF 1 instructions. 

The program: 


ST1 All ones 


G T1,R 
SM 
Wiki C, D Any flip-flop C, of register R, to any convenient 


address D. 


In order to check the flip-flop groups of central control, the following 
program was used: 


ST1 All zeroes 


G Tl, A (A is the flip-flop group being tested.) 
SM 
G A, R. 


As previously mentioned, individual flip-flops of register R may be 
examined. If a mismatch has occurred, the proper flip-flop may be iso- 
lated by repeated use of the two instructions SM, RFFO, followed by 
a check to see if the flying spot store order-match circuit indicated a 
mismatch on the RIFFO instruction (the instruction which followed SM 
was sampled for a mismatch); by checking all the flip-flops in R, the 
flip-flops of A which were not capable of being set were detected. R and 
Tl were previously checked to make certain that all their flip-flops 
could be set and reset. 

The above programs are typical. Common subroutines were used to 
record the results of tests in the barrier grid store and to control the 
typing out of these results with the teletypewriter. 
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Heuristic Remarks and Mathematical 
Problems Regarding the Theory 


of Connecting Systems 


By V. E. BENES 
(Manuscript received February 1, 1962) 


A connecting system consists of a set of terminals, a control unit for 
processing call information, and a connecting network. Together, these 
three elements provide communication, e.g., supply telephone service, among 
the various terminals. In this paper we present a comprehensive view of the 
theory of connecting systems, an appraisal of its current status, and some 
suggestions for further progress. 

The existing probabilistic theory is reviewed and criticized. The basic 
features of connecting systems, such as structure, random behavior, com- 
plexity, and performance, are discussed in a nontechnical way, and the 
chief difficulties that beset the construction of a theory of traffic in large 
systems are described. It is then pointed out that despite their great com- 
plexity, connecting systems have a definite structure which can be very useful 
in analyzing their performance. A natural division of the subject into 
combinatory, probabilistic, and variational problems is drawn, and is 
illustrated by discussing a simple problem of each type in detatl. 


I. INTRODUCTION 


Mass communication long ago spread beyond the manual central 
office and assumed a nationwide character; it is presently becoming 
world-wide in extent. Many of the world’s telephones already form the 
terminals of one enormous switching system. The scale, cost, and im- 
portance of the system make imperative a comprehensive theoretical 
understanding of such global systems. 

Nevertheless, a lack of knowledge about the combinatory and prob- 
abilistic properties of large switching systems is still a major lacuna in 
the art of mass communication. It is a fact of experience that each time 
a new switching system is planned, its designers ask once again some of 
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the perennial unanswered questions about connecting network design 
and system operation: How does one compute the probabilities of loss 
and of delay? What method of routing is best? What features make 
some networks more efficient than others? Ete. 

The present paper is an informal discussion of problems in the theory 
of traffic flow and congestion in connecting systems (called traffic theory, 
or congestion theory, for short). The comments to be made are pref- 
atory, tutorial, and illustrative. They are intended as background for 
several papers of a more technical nature; one of these papers! appears 
in this issue, and the remaining three?*+ are to appear later. In these 
papers, topics touched on in the present work are considered in greater 
depth and detail. Together, the papers are an attempt to describe a 
comprchensive point of view towards the subject of connecting systems. 
I believe that this point of view will be useful in constructing a general 
theory of connecting networks and switching systems. What follows is 
then in part a prospectus for research to be reported on in the future. 

My concern in this paper is with some of the physical bases and princi- 
pal problems, with the fundamentals and difficulties, of the subject. I 
wish to emphasize some important properties and distinctions on which 
a systematic approach may be based. I am making a plea for a much 
more general, abstract, and systematic approach to large-scale congestion 
problems than has been envisaged heretofore. 

Naturally, it is impossible to explore all the consequences of such a 
comprchensive approach in one paper; I do not pretend to have solved 
even some of the basic problems of the theory. I am only saying “Look, 
perhaps these observations will help provide a general approach.” 

Iixamples and simple problems appear in the text as illustrations of 
the principal points made. For tutorial purposes, I have chosen particu- 
larly simple and clear illustrations, which may seem trivial to cognos- 
centi of traffic theory. Nevertheless, it has been my experience in talking 
with engineers that the comprehensive view here presented is sufficiently 
new to warrant clear, simple examples. More complex problems do not 
belong in an introductory work; they are to appear in later papers. ~ 


II, SUMMARY 


In Section III we give a historical sketch of traffic theory, which: is 
followed by a critique of existing theories in Section IV. The general 
properties of switching systems are discussed in Section V. The per- 
formance of switching systems and desiderata for a theory of congestion 
are considered in Section VI and Section VII, respectively. Sections V 
to VII are heuristic and nonmathematical in character. Mathematical 
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models are considered in a general way in Section VIII, while Section IX 
concerns itself with some of the basic difficulties and questions that 
arise In constructing a theory of traffic in a large-scale system. 

In Sections X and XI we show that, despite their great complexity, 
connecting systems actually have a definite structure which can be very 
useful in analyzing their performance. This usefulness is exemplified by 
four specific instances in Section XII. In Section XIII we make a general 
division of the subject into combinatory, probabilistic, and variational 
problems. The remaining sections, Sections XIV to XVI, are devoted to 
illustrating this division by working out a simple problem of each type in 
full detail. 


III. HISTORICAL SKETCH 


We shall not attempt to canvass systematically the literature of con- 
gestion theory. For the interested reader, the best single theoretical ref- 
erence on the theory of probability in connecting systems is undoubtedly 
the treatise of R. Syski;® the historical development of the subject has 
been described in papers by L. Kosten’ and R. I. Wilkinson.? Neverthe- 
less, we include a brief account of previous work in order to substantiate 
our critique (Section IV) of present theories of traffic in connecting sys- 
tems. 

The first contributions to traffic theory appeared almost simultane- 
ously in Europe and in the United States, during the early years of the 
20th century. In America, G. T. Blood of the American Telephone and 
Telegraph Company had observed as early as 1898 a close agreement 
between the terms of a binomial expansion and the results of observa- 
tions on the distribution of busy calls.* In 1903, M. C. Rorty used the 
normal approximation to the binomial distribution in a theoretical at- 
tack on trunking problems, and in 1908 E. C. Molina improved Rorty’s 
work by his’ (or Poisson’s) approximation to the binomial distribution. 

In Europe, the Danish mathematician A. K. Erlang, from 1909 to 
1918, laid the foundations of the first dynamic theory of telephone traffic, 
which is in general use today. Perhaps influenced by statistical mechanics, 
Erlang introduced the notion of statistical equilibrium, and used it as a 
theoretical basis for deriving his now well-known loss and delay formulae. 
An account of Erlang’s work is given by Jensen.° 

T'rom 1918 to 19389 traffic theory developed in many directions that are 
(on retrospect) closely allied to specific problems that arose in the design 
of the automatic telephone systems that were coming into use, and in 


* Blood’s unrecorded work was reported by E. C. Molina and described by 
R. I. Wilkinson.? 
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related queueing systems. We mention only a few topics: T. Engset?® in- 
troduced the notion of a finite number of sources of traffic, G. F’. O’Dell4 
published a classical paper on gradings, C. D. Crommelin” studied con- 
stant holding-time delay systems with many servers, E.. C. Molina! made 
contributions to trunking theory. IF’. Pollaczek" and A. I. Khinchin'® 
studied the queue with one server, and derived the delay distribution 
that bears their linked names. Pollaczek has also solved single-handedly 
many other difficult loss and delay problems. All these important con- 
tributions are concerned with congestion in specific parts of connecting 
systems. During this period, T. C. Fry wrote the first systematic and 
comprehensive book" on applied probability; this book devoted a chap- 
ter to telephone traffic, and appeared in 1928. 

Between 1939 and 1948 there developed an increasing awareness 
(among workers in traffic theory) that the mathematical bases of traffic 
theory were closely related to the modern theory of stochastic processes 
initiated by A. N. Kolmogorov” in 1933. In particular, Erlang’s idea of 
statistical equilibrium was identified with the stationary measure of a 
Markov process (or more generally with a semigroup of transition prob- 
ability operators). Also, C. Palm!’ stressed the importance of recurrent 
processes, and W. Feller!® that of birth-and-death processes, to traffic 
theory. However, particular problems continued to form the bulk of 
the new literature. Palm'® made a penetrating theoretical analysis of 
traffic fluctuations, and L. Kosten studied such topics as retrials for lost 
calls,?° and error in measurements of loss probability.” 

The introduction of crossbar switching and common control of con- 
necting networks in 1938 (see Ref. 22) was accompanied by a new kind 
of problem: calculating the loss due to mismatching of available links 
(rather than to unavailability of trunks). The first comprehensive treat- 
ment of loss in such systems was given by C. Jacobaeus”; his theory is 
adequate for practical purposes, but is based on assumed a priori dis- 
tributions for the state of the system. R. Fortet?4 has also made contribu- 
tions to this topic in the spirit of Jacobaeus’ approach. A less satisfactory 
method for the same problems based only on the possible paths for a call 
has been developed (independently) by C. Y. Lee?® and P. Le Gall.?® 

The statistical equilibrium approach to congestion in crossbar systems 
is rendered extremely arduous by the large number of possible states. 
The difficulties in this method have been faced with some success by K. 
Lundkvist?” and A. Elldin?. However, no practically feasible approach 
exists at present that simultaneously includes both the concept of sta- 
tistical equilibrium and the structure of the connecting network. A 
fortiort, no approach exists that also includes the effect of the common 
control equipment that places calls in the network. 
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IV. CRITIQUE 


In comparison with the highly sophisticated communications systems 
that are being built, the models and assumptions on which theoretical 
studies are based are often crude and fragmentary, almost more indica- 
tive of our ignorance than of the properties of systems. It may be argued 
that such a harsh appraisal of the condition of traffic theory is unjustified, 
and is disproved by the practical successes of current engineering meth- 
ods. However, it is not the efficacy of these methods, but their theoretical 
basis and scope, that we are questioning. Who knows to what extent 
present systems are “‘overdesigned”’? 

To be sure, measures of performance, loss and delay formulas, and 
routing methods are in daily use. Still, only in very special cases have 
they been investigated, let alone analyzed and understood in the full 
context of the system to which they are applied. Although the published 
literature on telephone traffic alone is vast, and many models and prob- 
lems have been considered, the existing theories tend to be incomplete 
and oversimplified, applicable to at most a small portion of a system. 
Useful comprehensive models are needed; to date, only individual pieces 
of systems have been treated with theoretical justice. As R. Syski re- 
marks on p. 611 of Ref. 5: “At the present stage of development... the 
theoretical analysis of the [telephone] exchange as a whole has not been 
attempted.” The general theory of switching systems now consists of 
some apparently unrelated theorems, hundreds of models and formulas 
for relatively simple parts of systems, and much practical lore associated 
with specific systems. It will stay in this condition until sufficient the- 
oretical underpinning is provided to unify the subject. We believe that 
this sad “‘state of the theory” is due largely to these three factors: 

(¢) The large scale, and consequent inherent difficulty of the problems. 
(iz) The absence of a widely accepted framework of concepts in which 
problems could be couched and solved. 

(it) The lack of emphasis on and success with the combinatorial as- 
pects of the problems. 

More generally, many of the basic mathematical properties of con- 
necting networks and switching systems have either never been studied, 
or, if studied, have not been digested, advertised, and disseminated for 
engineering use. As a result, the design and complexity of systems has 
consistently run ahead of the analysis of their performance. 


V. GENERAL PROPERTIES OF CONNECTING SYSTEMS 


We start by discussing some universal properties of connecting systems 
from the point of view of congestion, without reference to definite mathe- 
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matical models for their operation. Specifically, we describe, in a 
nontechnical way, (2) the general nature and outstanding features of con- 
necting systems, (77) the principal kinds of congestion that interest en- 
gineers, and (777) some of the difficulties and desiderata in both the theory 
and practice of large-scale switching. No mathematical abstractions are 
used at first. Some observations made may seem obvious or trivial; 
nevertheless, they are necessary for the general understanding that we 
desire. On these observations, we shall base a systematic division of the 
theory into three kinds of problems, combinatory, probabilistic, and 
vartational., 

By a connecting system we shall mean a physical communication ‘sys- 
tem consisting of (7) a set of terminals, (77) control units which process 
requests for connection (usually between pairs of terminals), and (772) 
a connecting network through which the connections are effected. The 
system is to be conceived as operating in the following manner: (1) calls 
(or requests for connection) between pairs of idle terminals arise; (2) re- 
quests are processed by a control unit, and desired connections are com- 
pleted, if possible, in the connecting network; (8) calls exist in the net- 
work until communication ends; (4) terminals return to an idle condition 
when a call terminates. (Naturally, the arising requests may “‘defect’’ at 
any point during the process of connection.) 

The gross structure of a connecting system is depicted in Fig. 1. Most 
modern connecting systems follow this basic pattern. Particularly im- 
portant examples are telephone central offices, toll centers, telegraph 
networks, teletypewriter systems, and the many military communica- 
tions systems. 

All the examples cited share three important properties. These are (7) 
great combinatorial complexity, (i7) definite geometrical or other struc- 
ture, and (27) randomness of many of the events in the operating system. 

It is obvious that many connecting systems are highly complicated. 










CONNECTING 


NETHIORK |TERM INALS 








CONTROL 
UNIT 


Fig. 1 — Connecting system. 
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Both the control unit and the connecting network contain thousands of 
parts which may (together) assume millions of combinations. That is, 
the system can be in any one of millions of possible “states.’’ These num- 
bers are increased when several switching centers are considered together 
as a unit, as in toll switching. Our purpose in calling attention to this 
complexity is to suggest that it calls for theoretical methods that, like 
those of statistical mechanics, are especially designed to distill important 
facts from masses of detail. 

It is less often realized, however, that this complexity is accompanied 
by definite mathematical structure, and is frequently alleviated by many 
symmetries. The control unit and the connecting network always have a 
specific combinatory, geometric, and topological character, on which the 
performance of the system closely depends. 

By imputing randomness to the systems of interest we do not imply 
that their operation is unpredictable; we mean only that the best way of 
describing this operation is by use of probability theory. It is not prac- 
tical, even though it might be possible in principle, to predict the opera- 
tion of a switching system by means of differential equations in the way 
that the flight of a rocket is predicted. However, differential equations 
have been used for many years to describe, not the motion of an actual 
system, but the changes in the likelihoods or probabilities of its possible 
states. Such equations govern the flow or change of probabilities and 
averages associated with the system, not the detailed time behavior of 
the system itself. It is in this weaker sense of assigning likelihood to 
various events that we can predict the behavior of switching systems, a 
fact first emphasized by A. K. Erlang’s pioneering work on telephone 
traffic.? For instance, certain features (such as average loads offered and 
carried) of telephone traffic that are predictable in this weaker sense form 
the basis on which toll trunking routes are engineered. 

We now turn to examples of the structure of connecting networks and 
of control units. The basic features of the connecting network for the 
No. 5 crossbar system are shown in a simplified form in Fig. 2. The net- 
work has two sides, one for subscribers’ lines and the other for trunks. 
Small squares represent rectangular crossbar switches, capable of con- 
necting any inlet terminal to any outlet terminal. These switches are 
arranged in groups called frames, either line link frames for subscribers’ 
lines, or (on the other side) trunk line frames for trunks. Frames are indi- 
cated in Fig. 2 by large dashed squares enclosing four small squares; dots 
indicate repetition. The pattern of links which interconnect the switches 
is shown by solid lines between small squares. At most one link connects 
any pair of switches. 
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CROSSBAR 
77 SWITCH 





LINE LINK FRAMES TRUNK LINK FRAMES 


Fig. 2 — Basic No. 5 crossbar network. 


As a second example of a connecting network, consider the three-stage 
Clos network (see Ref. 29) depicted in Fig. 3. The interpretation of this 
figure is the same as that of Fig. 2: small squares stand for crossbar 
switches, and lines between them represent links. Each call can be put 
into the network in m ways, one for each of the m switches in the middle 
column. This network has the property that if m 2 2n — 1, it is non- 
blocking. 

A control unit consists of parts that are arranged in a manner reflect- 
ing their function, and are determined by the operations necessary to 
establish a connection, and by the philosophy of design and the tech- 


nxm rxr mxn 





Fig. 3 — Clos three-stage network. 
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Fig. 4 — Simple control unit. 


nology that are basic to the system. To establish a connection, the control 
unit must do some or all of the following: (2) identify the calling party or 
terminal, (77) find out who the called party is, and (777) complete the con- 
nection. Three examples will be considered, in order of increasing com- 
plexity and modernity. 

A simple example of the structure of a control unit is given in Fig. 4. 
The unit consists of a dial-tone marker which assigns and connects avail- 
able idle registers to subscribers for dialing. The dialed digits remain in 
the register until a completing marker (one of possibly several) removes 
them and uses them to complete the call. The calls, or requests for con- 
nection, may be thought of as arriving from the left, and proceeding 
through the diagram from left to right. There may be a delay in obtain- 
ing dial tone, a delay in securing the services of a completing marker, or 
a circuit-busy delay (or rejection) in the network. It should be observed 
that the switching equipment necessary for connecting subscribers to 
registers, or registers to completing markers, is left out of account in this 
model. 

A second example is obtained from the first by inserting a buffer 
memory between the registers and the markers as shown in Fig. 5. (One 


COMPLETING 
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REQUESTS DIAL TONE BUFFER TO 
FOR pie ae —» CONNECTING 
SERVICE NETWORK 


J 
1 
I 


CALL INFORMATION MOVED TO BUFFER 
MEMORY AS SOON AS POSSIBLE, 
TO FREE REGISTER 


Fig. 5 — Control unit with buffer memory. 
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can argue that registers are expensive special-purpose units and should 
not be used for storing call information when cheap memory is available.) 
When dialing is finished, the call information is forthwith transferred to 
the buffer memory, there to wait for a completing marker without pre- 
empting a register. The markers and registers are now effectively iso- 
lated, so that delays in completing calls do not cause delays in obtaining 
dial tone. Again, traffic is viewed as moving from left to right. 

The high speeds possible with electronic circuits have Ied to new con- 
figurations and problems (for control units and networks) which have 
not yet received much attention in congestion theory. Although it per- 
forms the same functions, the control unit of a modern electronic central 
office usually has an organization differing from that of the examples of 
Figs. 4 and 5, which are characteristic of electromechanical systems. Four 
principal reasons for this contrast are: 

(z) The electronic office relies heavily on a large digital memory to aid 
in processing calls and (in time division systems) to keep track of calls 
in progress; electromechanical systems, on the other hand, are based 
largely on ‘‘wired-in”’ memory. 

(ii) In the electronic office, processing a given call usually requires 
several consultations of the digital memory; thus, the flow of traffic in 
the control unit is re-entrant and not unidirectional as in Tigs. 4 and 5. 

(it) The speed of electronic components often makes it possible to 
perform only one operation at a time; thus, a single unit may be (alter- 
nately) part of a dial-tone marker, part of a register, part of a completing 
marker, etc., depending on the details of organization of the control unit. 

(iv) The replacement of “wired-in’” memory, whose stored information 
is immediately available, by an electronic memory which has to be con- 
sulted, creates problems analogous to the problem of connecting complet- 
ing markers to registers in the No. 5 crossbar system: special access units 
are needed. Subunits of the control unit, such as dial-tone markers, com- 
pleting markers, senders, etc., must take turns in using the access circuit 
to the digital memory. 

Fig. 6 depicts a (hypothetical) control unit for an electronic switching 
system built entirely around a memory which stores all information on 
the current status of calls. The control unit consists of various special- 
purpose units such as a sender, a receiver, a completing marker, a dial- 
tone marker, and registers. Each of the listed units can operate inde- 
pendently of and simultaneously with the others; however, they compete 
for (take turns at, possibly with priorities) the access circuit to the 
memory. Each unit depends on the memory to give it a new assignment, 
to file the results of the last one, or both. Every operation of a special- 
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Fig. 6 — Block diagram of electronic control unit. 


purpose unit requires access to the memory, either to obtain data from it, 
or to file data in it, or both. The memory contains several classes of calls: 
those waiting for dial tone, those waiting for a completing marker, those 
actually in progress in the connecting network, etc. 


VI. PERFORMANCE OF SWITCHING SYSTEMS 


In general, the gross or average features of switching systems are both 
more accurately predictable and more economically important than the 
specific details. The average load carried by a trunk group is usually more 
easily predicted than the condition of a particular trunk; and the “all 
trunks busy” condition of the group is of greater concern to the telephone 
administration than the busy condition of a single trunk. 

From the point of view of economics and traffic engineering, only cer- 
tain average features of the behavior of a system (used as measures of 
performance) are important. These few quantities of interest depend on 
the multitude of details of ‘fine structure” in the control unit and the 
connecting network. Although the intricate details give rise to the im- 
portant averages, the details themselves are of relatively little interest. 
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In the rest of this paper, we shall repeatedly contrast the few average 
quantities that are of engineering interest with the many millions of de- 
tailed features and properties (of connecting systems) on which the av- 
erages are based.:The central problem in the theory of connecting systems 
is to understand how the interesting quantities arise from the details, 
and to calculate them. 

We shall start our discussion of the contrasting roles of averaged fea- 
tures and details by considering some of the different kinds of congestion 
that interest engineers, and in addition some associated measures for the 
performance of systems. 

Congestion is said to occur in a connecting system when a requested 
connection cannot be completed immediately. By “‘immediately’’ we 
mean, of course, not “instantaneously’’, but “as fast as control equip- 
ment, assumed available, can do its work”. The time it takes to complete 
a call contributes to congestion only if it keeps other calls from being 
completed at the normal rate. That a call cannot be completed imme- 
diately (in this sense) may be due to facts of three kinds: (7) certain nec- 
essary units of switching equipment (like trunks, or markers) are all 
busy; (7) there are available units, but they occur in an unusable com- 
bination, or “fail to match’’; (272) congestion has occurred previously, 
and other requests are awaiting completion. 

In telephone traffic theory, requests for connections which encounter 
congestion are traditionally termed lost calls. This terminology is used 
whether the request is refused (and never completed), or merely delayed 
(and completed later). Switching systems differ in the disposition of lost 
calls, i.e., in what is done with requests which encounter congestion. 
There are in theory two principal ways of disposing of lost calls. In the 
first way, termed “‘lost calls cleared”’, the request is denied and leaves the 
system; this way of dealing with lost calls naturally gives rise to the 
proportion of requests dented, or the probability of blocking or loss, as a 
measure of performance. The second way of disposing of lost calls is 
termed “lost calls delayed”’, and consists in delaying the request until 
equipment becomes available for completing the connection; associated 
with this is the probability of delay in excess of a specified time ¢, as a 
measure of performance. 

On the simplified account of the last paragraph we must impose at 
least two qualifications. First, whether a request suffers blocking or de- 
lay (or both!) may depend on the condition of the system at times shortly 
after the request is made; second, the completion of a request usually in- 
volves a sequence of steps, any one of which may expose the request to 
delay or loss. For example, a request may encounter delay in obtaining 
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dial tone, delay in securing the services of a completing marker, and 
delay or blocking in the attempted completion of the desired connection 
through the connecting network. 

We conclude this section by briefly considering what general features 
of connecting systems are particularly relevant to their performance as 
measured (for example) by probabilities of blocking or delay, or by av- 
erage loads carried, offered, or both. Now, a connecting system has two 
principal parts, the control unit and the connecting network; the features 
of the system that are relevant to performance are conveniently distin- 
guished according to whether they are features of the control or of the 
network. This distinction is fundamental because the performance of the 
control is largely determined by the speed and number of the various 
sub-units comprising it, while the performance of the network is largely 
dependent on what combinations of calls can be in progress simultane- 
ously. 

The control unit is basically a data processing system: it collects in- 
formation about desired connections, digests it, makes routing decisions, 
and issues orders for completing requested calls in the connecting net- 
work. Its capacity is measured, e.g., by the number of customers who 
can be dialing simultaneously, or by the number of calls which are being 
completed in the network at the same time. Its performance is described 
by the probability distributions of delay before receiving dial tone, and 
of delay after completion of dialing until the desired connection is com- 
pleted. 

For a simple model of a control unit (such as depicted in Fig. 4), the 
features pertinent to performance are: (7) the calling rate, (77) the num- 
ber of registers for dialing, and (zz) the speed and number of completing 
markers. In the case of the prototype electronic control unit (depicted in 
Fig. 6) some additional features appear: (7v) the speed of the access cir- 
cuit to the memory, (v) the order of priority of the functions being per- 
formed, the discipline of access to various services, and the competition 
for access among marker, dial tone marker, sender, ete., (v7) the presence 
of re-entrant traffic (every call must “use” the access circuit at least 
twice), and (viz) the number and arrangement of the various functions 
which are going on simultaneously. 

The connecting network, in contrast to the control unit, determines 
what calls can be in progress, rather than how fast they can be put up. 
Its configuration determines what combinations of terminals can be con- 
nected simultaneously together. For example, if m 2 n, the Clos network 
of Fig. 3 has the property of rearrangeability: any preassigned set of calls 
can be simultaneously connected. The No. 5 network of Fig. 2 does not 
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have this property: the number of calls between a line link frame and a 
trunk link frame is limited by the number of links between those two 
frames. Such combinatory properties of the structure of the connecting 
network play a determining role in estimating the cost and the per- 
formance (probability of blocking) of the network. If the structure is too 
simple, very few calls can be in progress at a given time and blocking is 
high; if it is extensive and complex, it may indeed provide for many large 
groups of simultaneous calls in progress, and so a low probability of 
blocking, but the network itself may be prohibitively expensive to build 
and to control. 


VII. DESIDERATA 


Our discussion of the three prominent features of switching systems —- 
(z) great complexity, (72) definite structure, and (2727) randomness — has 
exposed or suggested some of the problems and desiderata which a theory 
of congestion in large-scale systems must (respectively) encounter and 
supply. Specific statements of requirements and tasks are now given. 

General desiderata can be obtained by examining the purpose served 
by a theory of congestion. The function of such a theory is twofold: it is 
(i) to describe the operation of switching systems, and (72) to predict the 
performance of systems. More specifically, the descriptive function (¢) is 
to provide a theoretical framework into which any system can be fitted, 
and which permits one to evaluate the performance of the system, e.g., 
to compute the chance of loss, to estimate a sampling error, or to prove 
a network nonblocking. The predictive function (77) has logically the 
same structure as (7), but emphasizes the use of theory to make future 
capital out of past experience, to extrapolate behavior and thus to guide 
engineering practice. 

More specific tasks than these appear when we list some of the activi- 
ties comprised by the theory and practice of traffic engineering. A possi- 
ble list is as follows: 

i. Describing and analyzing mathematical models. 
ai. Computing measures of performance for specific models. 
tat. Studying the accuracy of traffic measurements, the effects of 
transients, and problems explicitly involving random behavior in 
time. 

iv. Comparing networks, control systems, methods of routing, etc. 

v. Using traffic data to verify empirically the assumptions of theories. 
vt. Making predictions and estimates for engineering use. 

On the basis of this list, and of our previous discussions of complexity, 
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randomness, gross features, and details, we can say that a satisfactory 
theory of congestion must meet the following requirements: 

z. It must be sufficiently general to apply to any system. 

a. It must yield computational procedures for system evaluation and 
prediction of performance, based on masses of detail. These pro- 
cedures must be at once feasible and sufficiently accurate, and if 
approximations are made, their effect must be analyzable. 

ait. It must encompass all the three basic elements simultaneously, 
viz., the random traffic, the control unit, and the connecting net- 
work. 


Vil. MATHEMATICAL MODELS 


We shall now consider what mathematical structures are appropriate 
theoretical descriptions of operating connecting systems. The discussion 
will provide an intuitive picture of an operating system, and will help to 
motivate a natural division of our subject into combinatory, probabilistic, 
and variational problems. 

By a state we shall mean a partial or complete description of the condi- 
tion (of the system under study) in point of (2) busy or idle network links, 
crosspoints, and terminals, and (72) idle or busy control units or parts 
thereof. Complete, highly detailed descriptions correspond to fine-grained 
states specified by the condition of every crosspoint, link, or other unit 
in the system, in absolute detail. Incomplete descriptions correspond to 
coarse-grained states, or to equivalence classes of fine-grained states. 

During operation, the connecting system can pass through any per- 
mitted sequence of its states. Each time a new call arises, or some phase 
of the processing of a call by the control unit is finished, or a call ends, 
the system changes its fine-grained state. These changes do not usually 
occur at predetermined epochs of time, nor in any prescribed sequence; 
they take place more or less at random. At any particular time, it is 
likely that some terminals, links, and parts of the control unit are idle, 
that various requested calls are being processed, and that certain calls 
are in progress in the connecting network. 

The last paragraph suggests the following intuitive account of an op- 
erating switching system: it is a kind of dynamical system that describes 
a random trajectory in a set of states. Such an intuitive notion can be 
made mathematically precise in many ways. Any one precise version is 
a mathematical model for the operation of the switching system. In con- 
structing such a model, it is neither necessary nor desirable always to use 
the most detailed (the fine-grained, or microscopic) states; often a partial 
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description in terms of coarse-grained states suffices, and is less difficult 
to study. Indeed, in building a model it is to some extent possible to 
choose the set of states to suit special purposes. One can, for instance, 
control the amount of information included in the state so as to strike 
a balance between excessive detail and insufficient attention to relevant 
factors. It is possible to make the notion of state more or less complete 
so as to achieve certain (desired) mathematical properties (such as the 
Markov property, or a suitable combinatory structure) which simplify 
the analysis of the random trajectory. Finally, one can add supplemen- 
tary variables analogous to counter readings or cumulative measurements, 
and obtain their statistical properties. 

The abstract entity appropriate for describing the random behavior of 
a switching system is a stochastic process. For our present heuristic pur- 
poses, we can define a stochastic process as follows: by a possible history 
of the system we mean a function of time taking values in the chosen set 
of states; a stochastic process is then a collection Q of possible histories 
of the system in time, with the property that many (presumably in- 
teresting) subsets A of Q have numerical probabilities Pr{A} associated 
with them. The probability Pr{A} of the set A of possible histories is 
interpreted as the chance or likelihood that the actual history of the 
system be one of the histories from the set A. Models of this kind furnish 
information because desired quantities can be calculated from the basic 
probabilities Pr{A}. 


IX. FUNDAMENTAL DIFFICULTIES AND QUESTIONS 


The systematic use of mathematical models (such as stochastic proc- 
esses) in congestion theory and engineering has been largely limited to 
small pieces of systems like single-server queues, groups of trunks with 
full access, etc. More complex models of systems involving connecting 
networks have hardly been touched by theory. This limitation has been 
due almost entirely to the large number of states such models require, 
and to the complex structure of the transitions (changes of state) that 
can occur. In short, the essential characteristics (of large-scale connect- 
ing systems) themselves generate the basic difficulties of the theory. 

In most congestion problems, it is easy enough to construct (say) a 
Markov process that is a probabilistic model of the system of interest. 
But it is difficult, because of the large number of states and the complex- 
ity of the structure, to obtain either analytic results or fast, reliable 
simulation procedures. This circumstance has been a major obstacle to 
progress in the congestion theory of large systems. One of its conse- 
quences has been that in some cases, models known to be poor repre- 
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sentations of systems have been used merely because they were mathe- 
matically amenable, and no other tractable models were available. Even 
overlooking such extremes, it is fair to state that, to date, problems of 
analysis and computation have limited the amount of detail embodied 
in the notion of state for models of switching systems. Every effort has 
been made to keep the number of states in models small, and their com- 
plexity low. 

Having exposed some basic properties of and theoretical problems 
arising from congestion in connecting systems, let us acknowledge that 
an operating, large-scale connecting system cannot be done full theoreti- 
cal justice except by a stochastic model with an astronomical number of 
states and a very complicated structure of possible transitions. At this 
point, let us try to take a synoptic view of the subject, and ask some 
general questions whose discussion might indicate new approaches and 
emphases. Let us, in the current idiom, lean back in our chairs, make a 
(n) (agonizing?) reappraisal, and draw ourselves the “big picture.’’* 

The following three questions seem (to this writer) to be pertinent, 
and are taken up in the next sections: 

i. What is the value of mathematical models that have a very detailed 
notion of state? 

it. Is it possible to make explicit theoretical use of the very properties 
of connecting systems that appear to be most troublesome? How can the 
two principal difficulties (large number of states, complex structure of 
changes) be turned into positive advantages? 

wit. What features of connecting systems are especially relevant to the 
mathematical analysis of system operation? 

We do not pretend to provide iron-clad answers to these questions. 
We try to give a helpful discussion of relevant matters, illustrated by 
examples. 


xX. THE MERITS OF MICROSCOPIC STATES 


We have raised the question: To what extent can detailed probabilistic 
models of the minutiae of operating switching systems (i.e., models with 
‘“‘microscopic” states) improve our understanding of these systems, and 
so our ability to engineer them? Against the value of such detailed models 
it can be argued that for engineering purposes only certain performance 
data are of interest, and that the detailed model produces a vast amount 
of information with no apparent practical method for reducing this in- 
formation to probabilities of delay or blocking. 


* Supplying those clichés whose substitution leaves the content of this last 
sentence invariant is left as an exercise for the reader. 
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Since the usefulness of mathematical models depends entirely on the 
desired information they can be forced to yield, it is not reasonable to 
dismiss detailed models a priorz. For in truth, few if any such models 
have been considered, and it has not been shown that they are useless 
in the sense that no practical method for extracting useful quantities from 
these models exists. 

To be sure, the congestion engineer is not as concerned with the min- 
utiae themselves as with their effect en masse. But he has to base his 
conclusions and recommendations in some way on the total effects of a 
large number of individually trivial events. Hence, at some point in his 
procedure, he must take account of the large number of states and the 
complex structure of possible transitions of his system. 

Traffic engineering practice is based on (relatively few) probabilities 
and averages, such as average loads, deviations about them, and blocking 
or delay probabilities. Any reliable theoretical estimate of these averages 
must be based on the combinatory and probabilistic properties of a 
theoretical model (stochastic process) for system operation. At worst, 
an approach or model that provides detailed information might yield a 
much-needed check point for the methods that are in current engineering 
use, and so increase the engineer’s understanding of and confidence in 
these methods. 

However, there is a much more general, positive sense in which atten- 
tion to the details of connecting systems can contribute to theoretical 
progress. This is taken up in the next section. 


XI. FROM DETAILS TO STRUCTURE 


The prospect of solving (say) statistical equilibrium equations for 
models with a very detailed notion of state is discouraging indeed, al- 
though it has been faced, notably by Elldin?® in Sweden. Nevertheless, a 
sanguine and useful approach (along this line) to connecting systems can 
be obtained by a shift of emphasis from ‘details’ to “structure.”? We 
have emphasized that describing an operating connecting system means 
keeping track of numerous details, none of which is interesting in itself. 
We have said that the operation of such a system could be pictured as a 
trajectory in a very complicated set of states. We now claim that the 
inclusion of enough details (in the notion of state for a model) gives the 
set of possible states a definite structure that 1s useful because it makes 
possible or simplifies the analysis of the probabilistic model. 

Whatever may be the value of detailed probabilistic knowledge for the 
immediate problems of engineering, such knowledge is useful if not essen- 
tial in theoretical studies. By using a highly detailed, “microscopic” 
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description for the state of the system, it is possible to exploit the exten- 
sive mathematical structure (properties) that such a set of states nat- 
urally has. Indeed, the combinatory properties and geometrical structure 
of the set of states are two of the very few weapons available for attacking 
large-scale problems of traffic theory. I believe that in the past these 
properties and this structure have not been sufficiently exploited. They 
can only be put to use by a systematic application of ‘‘microscopic”’ 
states. 

The three basic properties of switching systems discussed in Section 
V were (7) extreme combinatory complexity, (22) definite geometrical 
structure, and (777) randomness. The preceding paragraphs of this section 
can be related systematically to these properties, and elaborated into a 
sort of program: Instead of throwing up our hands at (7) in trying to do 
justice to (272), we should realize that a detailed notion of state allows us 
to turn (72) to our advantage in studying (277). Let us then disregard the 
fact that there are many states, and analyze the structure of possible 
changes of state, to see how to capitalize on it. 

For, indeed, the possible microscopic states of a particular connecting 
system are not arbitrary. They are rigidly determined by the combina- 
tory and topological properties of the connecting network, and by the 
organization of the control unit. Such a set of possible states has a mathe- 
matical structure of its own, and this structure is relevant to the per- 
formance of the system, and to any stochastic process that represents its 
operation. 

It can be seen quite generally that when a switching system changes 
its microscopic state, it can only go to a new state chosen from among a 
few “neighbors” of the state it is leaving. These neighbors comprise the 
states which can be reached from the given state by starting a new call, 
ending an existing call, or completing some operation in the control unit. 
In a large system, a state may have many such neighbors, but they will 
be few in comparison with the total number of microscopic states. 

A striking and useful example of how details give rise to structure can 
be obtained by considering the possible states of a connecting network. 
These states can be arranged in a pattern as follows: At the bottom of the 
pattern we put the zero or ground state in which no calls are in progress; 
above this state, in a horizontal row, we place all the states which consist 
of exactly one call; continuing in this way, we stack up level after level 
of states, the kth level Li consisting of all the states with & calls in 
progress. 

We now construct a graph by drawing lines between states that differ 
from each other by exactly one call. (Such states, needless to say, are 
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always in successive levels of our diagram.) This graph we call the state- 
diagram. It is a natural (and standard) representation of the partial or- 
dering < of the states: where x and y are states, 


esy 


means that y can be obtained from x by adding zero or more calls to 2, 
or alternately, that x can be got from y by removing zero or more calls. 
The importance of this state-diagram lies in two facts: 

1. The state diagram gives a geometrical representation of the possible 
states of the system. The myriad choking ‘details’? of the connecting 
network have been converted into a vast geometrical structure with 
special properties. The operating system describes a trajectory through 
the state diagram, moving between levels as calls begin and end. 

a. Any stochastic process describing the operation of the connecting 
network is a point moving randomly on the state diagram. The motion 
is only between adjacent levels. New calls put into the network cor- 
respond to jumps to the next higher level; hangups correspond to jumps 
to the next lower level. 

As a simple example, we consider the possible states of a single 2 by 2 
switch. These consist of (z) the zero state, (7) the four ways of having 
one call up, and (277) the two ways of having two calls up. These states 
are depicted in Fig. 7. Fig. 8 shows the states of a 2 by 3 switch. 


XII. THE RELEVANCE OF COMBINATORY AND STRUCTURAL PROPERTIES: 
EXAMPLES 


In this section we elaborate, by discussing examples, our theme that 
the combinatory and structural properties of connecting systems are of 


tf oft 


To kt 
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Fig. 7 — States of a 2 by 2 switch. 
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Fig. 8 — States of a 2 by 3 switch. 


the greatest import (z) to their performance, and (77) to the analysis of 
mathematical models of their operation. The organization of the control 
unit and the configuration of the connecting network largely determine 
the possible microscopic states of the system. Let us see what effects 
these features can have on problems of system analysis. 

Example 1: Any connecting system has a “zero” or ground state in 
which all terminals and links are idle, no calls are being processed by 
the control unit, and the connecting network is empty. The existence of 
this zero state is a structural property common to all switching systems. 
This zero state seems most uninteresting. Nevertheless, many probabilis- 
tic models (for switching system operation) have the property that if 
the equilibrium probability of the zero state is known, then that of any 
other state can be determined in a simple way. Several specific examples 
of this phenomenon are worked out later in this paper, so none will be 
given here. (See Sections XV and XVI.) 

Example 2: The relevance of combinatory properties of the connecting 
network to the calculation of probabilities can be vividly illustrated by 
reference to Clos’ work on nonblocking networks (see Ref. 29). The 
blocking probability of a connecting network is the fraction of attempted 
calls that cannot be completed because no path for the call exists in 
the current state of the network. Until Clos’ article appeared it was 
not generally known that, no matter what probabilistic model was used, 
an exact calculation of blocking probability for a Clos network with 
m = 2n — 1 (see Fig. 3) would yield the value zero!* 


* Zero, not zero factorial, which equals unity! 
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Example 3: Consider the class of connecting networks which have the 
property that in any state of the network, two idle terminals (forming 
an inlet-outlet pair) can be connected in at most one way. For each 
member of this class of networks we construct a Markov stochastic 
process to represent its operation under random traffic, as follows: in 
any state, if an inlet-outlet pair is idle, the conditional probability 
is \h + o(h) that it request connection in the next interval h, as h — 0; 
also, an existing call terminates in the next interval h with a probability 
h + o(h), as h — 0; requests that encounter blocking are denied, and 
do not change the state of the system (lost calls cleared). 

If X is a finite set, let | X | be its cardinality, i.c., the number of 
elements of X, and let S be the set of all states of the network under 
discussion. For x in S, define 


A, = set of states accessible from x by adding a eall 

B, = set of states accessible from x by removing a call 
| a | = number of calls in progress in state x 

L;, = set of states with & calls in progress. 


Note that | B,| = |2|. 

Let p, be the stationary or equilibrium probability that the system 
is in state x. By reference to I*ig. 9, it can be seen that the statistical 
equilibrium equations for our probabilistic model are 


yeAy eBy 


Since in any state an idle pair can be connected in at most one way, 
no routing decisions need to be made, and the solution of this equation 


Liz} 


Lyxy-1 





Fig. 9 — A state xv, and the sets A, , B, in the state diagram. 
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(regardless of the network configuration!) is given by 


Pc = Pod i | : x0 
pot=1+ DA"! 
yeS 
y>0 
= EN | Le | 
K=O 


where 0 is the zero state. We have therefore shown that the simple 
combinatory property, that a call can be put up in at most one way, 
implies that the stationary probabilities of the Markov process we de- 
fined are of a simple geometric type. Note the important role played 
by the zero state, as discussed in Example 1. 

Example 4: The Markov stochastic processes of the previous example 
can be used to illustrate another important point. There are many 
switching system models for which quantities of interest (such as the 
probability of blocking) can be given rigorously, without approximations, 
by a formula in which the distinction between system combinatorics 
and random customer behavior appears explicitly. In Example 3, the 
state probabilities {p,, « « S} are completely determined by the quan- 
tities 

| Le |, k20 


i.e., by the number of states with k calls in progress, for k 2 0. For 
these models we can express the blocking probability as a function of 
the traffic parameter \ and of | LZ, |, k 2 0. The numbers | L, | repre- 
sent purely combinatory properties of the network. 

The blocking probability b can be calculated as follows: 6 is the frac- 
tion of attempted calls that are unsuccessful, so that 


a oe total rate of successful attempts 
total rate of attempts 





In equilibrium, the total rate of successful attempts must equal the 
total rate of hang ups. The total rate of hang ups is 


> pz |x| = mean number of calls in progress 
xeS 


(because the mean holding time is used as the unit of time). Let N 
be the number of terminals offering traffic. Since an idle inlet-outlet 
pair calls at a rate A, the attempt rate in a state x is 


- (number of idle pairs in a state x) = X (" te ) , 
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The total rate of attempts is then 
N—2|2 
a 2!) 
zeS 


Hence, 


Dz | 2 | 
zeS 


pr. ae eso le 
zeS 
[N/2] 
> Nk | Le | 
= k>0 


[N/2] ’ 
— 
> ron (% ; ‘ 
k>0 


where [N /2] is the greatest integer less than or equal to N/2. This for- 
mula exhibits the blocking probability as a rational function of the 
calling rate \ per idle pair and as a bilinear function of the combinatory 
constants {| LZ; |, k 2 O}. The degree of the denominator in d is one 
more than that of the numerator, sob — 1 as\ — ©; also, note that 


fim keel 
»>0 N 
(¢) 


This limit is greater than zero if there are calls which cannot be put up 
in any way. Finally, we observe that if the network is non-blocking, then 


vanes eee) 





zreLp_1 
— 2k +2 
and so b = 0, as it should, if we interpret 
C — alia 
2 


as zero. 


XIII. COMBINATORY, PROBABILISTIC, AND VARIATIONAL PROBLEMS 


The preceding discussions have established that the ingredients going 
into a mathematical model of a connecting system are of two kinds. 
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On one hand are the combinatory and structural properties, and on the 
other, the probabilistic features of traffic. We emphasize the distinction 
between these aspects, and claim that by carefully drawing it, we can 
extend the general understanding of connecting systems, unify or 
modify existing theoretical methods, and obtain new engineering results. 

Our discussion also suggests that to study stochastic processes that 
represent operating connecting systems, it is essential to have an ex- 
tensive theory of the combinatory and topological nature of the micro- 
scopic states of such systems. 

In any specific model of a connecting system, one can distinguish the 
combinatory from the stochastic features. However, it is also of interest 
to compare models of systems in an effort to determine optimal systems. 
These facts suggest a useful though imprecise division of the entire 
subject (of connecting system models) into three broad classes of prob- 
lems. In order of priority, these are 

z. Combinatory problems. 

wt. Probabilistic problems. 

au. Variational problems. 

This order of priority arises in a natural way: one needs to study com- 
binatory problems in order to calculate probabilities; one needs both 
combinatory and stochastic information in order to design optimal 
systems. 

The tripartite division just made provides a rational basis for organiz- 
ing research effort. Since so many of our pronouncements have been 
generalities, we devote the remainder of the paper to illustrating care- 
fully each of the three divisions (combinatory, probabilistic, variational) 
by working out and discussing in detail a very simple (yes, a trivial) 
problem from each division. These problems have been chosen for their 
tutorial value rather than their realism or usefulness. In discussing them, 
we place emphasis on furthering insight rather than solving practical 
problems, on exposing principles rather than providing engineering data. 


XIV. A PACKING PROBLEM 


It has long been suspected (and in some cases, verified experimen- 
tally) that routing calls through a connecting network “in the right 
way”’ can yield considerable improvements in performance. This pro- 
cedure of routing the calls through the network is called “packing” 
(the calls), and the method used to choose routes is called a ‘packing 
rule.”’? The use of the word “‘packing”’ in this context was surely sug- 
gested by an analogy with packing objects in a container. However, 
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the existence and description of packing rules that demonstrably im- 
prove performance (e.g., by minimizing the chance of. eae) are 
topics about which very little is known. 

What, then, is the “right way” to route nila? It he neat argued 
heuristically that it is better to route a call through the most heavily 
loaded part of the network that will still take the call. Appealing and 
simple as this rule is, nothing is known about it. We know of no pub- 
lished proof of either its optimality or its preferability over some other 
rule. The rule will be proven optimal for an example in Section XVI. 

The question naturally arises, though, whether for a given network 
in which blocking can occur there exists a packing rule so cunning that 
by following it all blocking is avoided. Then, use of the rule makes the 
network nonblocking. Such a network may be termed nonblocking in 
the wide sense, while a network none of whose states has any blocked 
calls may be termed nonblocking in the strict sense. 

The existence of such a rule is a purely combinatory property of the 
network, and so serves as an example of the first type of problem de- 
scribed in Section XIII. Unfortunately, practically useful connecting 
networks that are nonblocking in the wide sense are yet to be found. 
Since we are primarily interested in exemplifying principles, we shall be 
content with discussing an impractical network that is nonblocking 
in the wide sense. The example to be given was suggested by E. F. 
Moore.* 

Let us first consider the three-stage connecting network depicted in 
Fig. 10. All switches in the middle column are 2 by 2, and there are 2n — 1 
of them, so, by a result of C. Clos,” the network is nonblocking. Suppose 
that we use the rule that an empty middle switch is not to be used unless 
there is no partially filled middle switch that will take the call. In other 
words, do not use a fresh middle switch unless you have to! In general, 
this rule is not quite the same as the one exhorting use of the heavily 
loaded switches wherever possible, because it only tells us what to 
avoid, but it is in the same spirit. In the case to be considered, however, 
a middle switch is either empty, half-full, or full; hence the two rules 
coincide. 

We shall show that if this rule is used, then no more than [3n/2] middle 
switches are ever used, where [1] is the greatest integer less than or equal 
to x. Thus the rest, about one quarter of the middle switches, could be 
removed and no blocking would result if the rule were used. It can be 
verified by examples that if there are only [87/2] middle switches and 
the rule is violated, then calls can be blocked. Thus, the network of 


* Private communication. 
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2x2 





Fig. 10 — Three-stage nonblocking connecting network (Clos type). 


Tig. 11 is not nonblocking in the strict sense, but is nonblocking in the 
wide sense. 

A state x of a connecting network is called reachable (under a rule p) 
if using the rule p to make routing decisions does not prevent the system 
from reaching z from the zero state. We set 


S(x) = number of middle switches in use in state x. 


Let us use the diagram of Fig. 12 as a canonical representation for a 
2 by 2 middle switch. The numbers at the left [top] indicate to which 
outer switch on the left [right] the numbered link connects. The seven 
possible states of a middle switch are depicted in Vig. 13, and are in- 
dexed therein by letters a,b, ---, g. A state x may then be represented 
(to within renaming switches and terminals) by giving seven integers 
a(x), b(@), --- , g() where 


a(x) = number of middle switches of type a when network is 
in state x 


2x2 





[3n/2] 


Fig. 11 — Three-stage network which is nonblocking if proper routing is used. 
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CROSSPOINT 


Fig. 12 — Representation of a 2 by 2 middle switch. 


g(x) = number of middle switches of type g when network is 
in state 2. 


It is clear that for any state x 


a(x) + b(x) + -+- + g(x) = 2n- 1 
b(x) + e(x) + +++ + g(x) = S(a). 
MIDDLE SWITCH STATE TYPE CALLS 

1 2 
1 
bs a NONE 

| | b (1,1) 

| | c (2,2) 

| | d (2,1) 

| | e (1, 2) 
ae f (1, 1)(2,2) 
ain g (2,1)(1,2) 


+ = CLOSED CROSSPOINT 


Fig. 18 — Seven possible states of a middle switch. 
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Theorem 1: Let p denote the rule: Do not use an empty middle switch 
unless necessary. Let x be a state of the network of Fig. 10. Let x be reach- 
able under p. Then forn 2 2 


S(x) < [Bn/2] (1) 
b(x) + c(a) + f(x) Sn} 2) 
d(e) + e(e) + g(a) S nf : 


Proof: Each reachable state is reachable in a certain minimum number 
of steps. The thevrem is true if x consists of one call and is reachable 
from the zero state in one step. As an hypothesis of induction, assume 
that the theorem is true for all states reachable in k steps or fewer. 
All changes in the state are either hangups, or new calls of the following 
kinds: 


Type 1: 
aty) > a(y) — 1 
(1,1) by) > by) +1. with c(y) 
(2,2) ey) ae(y) +1 with by) 
(2,1) dy) >diy) +1 with e(y) 
(1,2) ey) >ey) +1 with d(y) 
Type 2: (preferred by p) 


I 
SS So SS 


a(y) remains fixed and 
(,1) fw -fy +1, ey cy) —1 with cy) >0 
(2,2) fy -~fy +1, by) by) -—1 with by) >0 
(2,1) gy —>9y +1, ey) >ey) - 1 with ey) > 0 
1,2) gy mg +1, dy)—dy) -—1 with dy) > 0. 
All states, reachable or not, satisfy the inequalities 

by) + ey) +fy) +9) Sn 

cy) + dy) + fy) +9y) Sn 

by) + dy) + fy) +9y) Sn 

c(y) + ey) +f) + gy) Sn. 


The alternative preferred by p changes neither the value of S(-) 
nor the truth of (2) of the theorem. Consider a state x first reachable in 
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k + 1 steps. If x is first reachable by a hangup or by putting up a call 
of Type 2, then (1) and (2) are true of z. Suppose then that 2 is first 
reachable in & + 1 steps only by putting up a call of Type 1. Without 
loss of generality we can consider only the case where the new call is a 
(1, 1) call; the other three cases are symmetric. Let y be a state from 
which 2 is thus first reachable. Since the avoided alternative is used, 
we have 


c(y) = 0. 

Since a (1, 1) call is possible in state y, we must have 

by) + dy) + fy) +9) Sn-1 

by) + ey) +fy) +9y) Sn-1 
and from the induction hypothesis 

dy) + ey) + gy) Sn. 
Hence, 
2{b(y) + dy) + ey) + fly) + gy)} S 38n — 2 


or, since c(y) = 0 
S(y) s ” —1. 


However, S(x) = S(y) + 1, so S(x) S [8/2]. To show that (2) also 
holds of x consider that 


b(y) + ey) + fy) + gy) 
c(y) 


n— 1 


0. 


IA 


It follows that 
by) +ey +fy sn— 1. 


However, since x is obtained from y by putting up a (1, 1) call of Type 
1, we have 


bz) = by) +1, — e(x) = ely) 
c(x) = cy) = 0, f(z) = fy) 
d(x) = d(y), g(x) = g(y). 


Hence, (2) of Theorem 1 is true of x. This proves the result. 
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XV. A PROBLEM OF TRAFFIC CIRCULATION IN A TELEPHONE EXCHANGE 


We shall describe and analyze a simple stochastic model for the 
operation of the control unit of a switching system. The connecting 
network is assumed to be nonblocking and is left out of account. 

To set up a telephone call in a modern electromechanical automatic 
exchange usually involves a sequence of steps which are (traditionally 
and functionally) divided into two groups. The first group consists in 
collecting in a register the dialed digits of the called terminal. The 
second group, performed by a machine called a marker, consists in 
actually finding a path through the connecting network for the desired 
call, or otherwise disposing of the request for service. For even if a path 
to the called terminal be found, this terminal may already be busy. 

In the exchange, enough registers and markers must be provided to 
give customers a prescribed grade of service. For engineering purposes, 
then, it is desirable to know the probability that r registers and m markers 
are busy. Let us assume that the exchange serves N customers, and that 
there are R registers and M markers. All calls are assumed to go to 
terminals outside the exchange. 

We may think of each customer’s line as being in one of a number of 
conditions, and moving from one condition to another. It makes no 
difference whether we ascribe these ‘‘conditions” to the line itself, or 
to a fictitious single customer if several people use the line. A given 
line may be ?dle (i.e., not in use); at some point in time it may request 
a connection, i.e., the customer picks up the receiver and starts wazting 
for dial tone; after obtaining a register he spends a certain amount of 
time dialing; he then waits for a marker to complete his call (freeing the 
register meanwhile); upon obtaining a marker, he must wait until the 
marker completes the connection; at this point he begins his conversation; 
at the end of his conversation his line becomes idle again. 

One may now ask, what is the distribution of the-N customers among 
these various conditions? Clearly, if not enough markers are provided 
there will be a tendency for the customers to collect in the ‘waiting 
for a marker’? condition; a lack of registers will make the customers 
collect in the ‘“‘waiting for dial tone”’ condition. 

To obtain a simple probabilistic model for the “‘circulation” of custo- 
mers, we assume that the probability that an idle customer starts a 
call in the next interval of time of length h is Ah + o(h), the chance that 
a dialing customer completes his dialing in the next interval h is 6h + 
o(h), the chance that a busy marker finishes the call it is working on is 
uh + o(h), and the probability that a conversation ends is h + o(h), 
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all as h — 0. The probability of more than one such event in h is o(h) 
ash — 0. 

These assumptions are in turn consequences of assuming that the 
time a customer stays idle, the time a customer takes to dial, the time 
a marker takes to complete a call, and the holding time (conversation 
length) are all mutually independent random variables, each with a 
negative exponential distribution, and the respective means A~!, 67}, 
uw! and unity. The number ) is the calling rate per idle customer, 6 
and uw are the average rates of dialing and call completion by a marker 
(respectively), and time is measured in units of mean holding time, 
so that the hangup rate per call in progress is unity. The assumption 
that the marker operation times are exponentially distributed is not 
realistic, but we make it here in the interest of obtaining a global model 
whose statistical equilibrium equations can be solved in a simple way. 
This restrictive assumption could be avoided at the cost of complicating 
the mathematics. The important features of our model are depicted in 
Vig. 14; the labeled arrows indicate the rates of motion for various 
transitions. 

The state of the system is adequately described by stating the number 
7 of idle customers, the number r of customers that are dialing or waiting 
for dial tone, the number m that are being serviced by a marker or are 
waiting for a marker, and the number c of calls in progress. Actually, 
any three of these numbers suffice, since for physically meaningful states 


ttrtm+c=N. 
Let Dirme be the equilibrium (or stationary) probability of the state 
R REGISTERS 


AL MAX (0, r-R) | 
WAITING FOR ————> | 
| 


DIAL TONE 


OMIN (r,R) 





jAMIN(m,M) MAX (0, M-M) 
———— WAITING FOR 
A MARKER 


Fig. 14 — Diagram of a telephone system. 
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(i,r,m,c). The “statistical equilibrium’? equations are, with suitable 
conventions at the boundaries. 


(At + r) min(r,/) + BK min(m,M/) + ©) Dirnie 
= (¢ + 1) pGetyrmcegty FAG H+ DDG c—-me 
+ 6min(r + 1,2) pire m—pye + w min(m + 1,M)Dirmsy(c—v- 


These equations state that the average rate at which a state is left 
equals the average rate at which it is reached from other states. We 
observe that the flow of calls in the exchange is in a sense cyclic; in 
making a call, each customer passes through four stages: idle, dialing, 
marker, conversation, then back to idle, in that order. This fact yields 
a way of solving the equations. Each side of the equilibrium equations 
has four terms, one for each of the four stages of a call. We shall find 
a way of assigning to each term on the left a corresponding equal term 
on the right which will cancel it. 

The solution of the equations for (7,r,m,c) # (N,0,0,0) is proportional 
to 


TI max (1,7/R) [J max (1, 7/1) 
NN! j<0 j=0 


ilrtmte! erm 


Sevpitnve = 


The constant of proportionality is the probability of the ‘‘zero”’ state 


—1 


Proo = (’ “i ye sae 


SG emoeo" 
i<n 

obtained from the normalization condition for probabilities. The alge- 
braic character of the solution is closely analogous to the actual pattern 
of circulating traffic in Fig. 14, for the easiest way of showing that 
firme 18S actually a solution of the statistical equilibrium equations is to 
make the following correspondence between terms on opposite sides of 
the equations: 


MDPirme ™ (¢ + 1) pci-1yrm¢e41) 
6 min(7,R)Dirme ~ AH 1) pcesaycr—tyme 
wmin(m,M) Dirme ~ 6 min(r + 1,R)picret (me 
CPirme ~ w min(m + 1,11) Dircmsty(ce—2)- 


It can be seen that each term on the left cancels the corresponding 
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one on the right when firme is substituted. Each term represents the 
(total) rate of occurrence of one of the four kinds of possible event: 
request for service, completion of dialing, completion of a call, and 
hangup. In the life history of a given call, these events occur in the 
natural cyclic order given. vents associated with corresponding (i.e., 
canceling) terms are next to each other in this cyclic order. 


XVI. AN OPTIMAL ROUTING PROBLEM 


Our final example is a variational problem involving both combina- 
toric and probability. We shall exhibit some particular answers to the 
following question: If requested connections can be put up in a con- 
necting network by several different routes, leading to different states, 
which routes should be chosen so as to minimize the probability of 
blocking? This question poses a variational problem in which many 
possible methods of operating a connecting network of given structure 
are compared, rather than one in which different network structures 
are compared. 

We shall consider this question for a connecting network that is of 
little practical significance because it is obviously wasteful of crosspoints. 
Its virtues, however, are that it is perhaps the simplest network for 
which our question can be asked, and that it clearly exhibits the prin- 
ciples and arguments involved, so that these can be understood. The 
network is shown in Fig. 15, the squares standing for square 2 by 2 
switches, 

The possible states of this network are determined by all the ways 





2x2 — 
SWITCH - 


X = CROSSPOINT 


Fig. 15 — A simple network in which optimal routing is studied. 
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in which four or fewer inlets on the left can be connected pairwise to 
as many outlets on the right, no inlet being connected to more than 
one outlet, and vice-versa. These possible states are depicted in a 
natural arrangement in lig. 16; states which differ only by permutations 
of customers or switches have been identified in order to simplify the 
diagram. That is, there is essentially only one way to put up a single 
call, there are four ways of having two calls up, two ways each of having 
three and four calls up. These “‘ways’’ have been arranged in rows 
according to the number of calls in progress, and lines have been drawn 
between states that differ from each other by only the removal or 
addition of exactly one call. 

For ease of reference, let us number the states in the (partly arbitrary) 
way indicated in Fig. 16; insofar as possible, we have used small num- 


Swkqs sqbee 
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/ \ / \ / La 
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Fig. 16 — (Reduced) state diagram for the network shown in Fig. 15. 
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bers for states with small numbers of calls. The set of possible states of 
our example then consists of (essentially) ten different configurations 
of calls in the basic network of Fig. 15. The state diagram, with each 
state identified now only by its number within a small circle, is schema- 
tized in Fig. 17. Also indicated in this schema are two important sets of 
quantities associated with the states. To the left of each state is the 
number of idle inlet-outlet pairs, and to the right of each state is the 
number of idle inlet-outlet pairs that can actually be connected, i.e., 
that are not blocked. 

Only in the state numbered 4 are there any blocked calls. It is to be 
noticed that state 4 realizes essentially the same assignment, of inlets 
to outlets as state 2, which has no blocked calls. The difference between 
the two is that in state 2 all the traffic passes through one middle switch, 
leaving the other entirely free for any call that may arise. Clearly, then, 
this difference illustrates the ‘‘packing rule’ that one should always 
put through a call using the most heavily loaded part of the network 
that will still accept the call. 

The question naturally arises, therefore, whether this packing rule is 





Fig. 17 — Schema of state diagram. 
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in any sense optimal for our particular example. We shall prove that it is, 
in two senses. It is clear from an inspection of the state diagram that 
only in state 1 is there ever a choice of route, and that this choice 
is always between states 2 and 4. From the fact that state 4 is the only 
state with any blocked calls, it is intuitively reasonable to expect that 
the probability of blocking is the least if the “bad” state 4 is avoided 
as much as possible, i.e., if from state 1 we always pass to either 2, 3, 
or 5, and visit 4 only when we have to, via a hangup from state 6. 

The next task is to choose a probabilistic model for the operating 
network; this will be done in the simplest possible way. We postulate 
that in any state of the system, the probability that a given idle inlet- 
outlet pair request connection in the next interval of time h is \h + o(h), 
the chance that an existing connection cease is h + o(h), and the chance 
that more than one event (new call or hangup) occur in h is o(h), as 
h — 0. The number ) is the calling rate per idle pair, and time is meas- 
ured in units of mean holding time, so the “hangup”’ rate is unity. New 
calls that are not blocked are instantly connected, with some specific 
choice of route, while blocked calls are lost and do not affect the state of 
the system, their terminals remaining in the idle condition. 

To complete the probabilistic description of the behavior of the sys- 
tem, it remains to specify how routes are chosen. In our example, this 
amounts to specifying whether, for certain calls arising in state 1, the 
route leading to state 2 or that leading to 4 is chosen. At first we shall 
only consider methods of choice that are independent of time, i.e., 
the choice is made in the same way each time. 

The methods of choice over which we shall take an optimum may 
be parametrized as follows: each time a choice is to be made between 
going to state 4 and state 2, a coin is tossed with a probability a of 
coming up heads. If a head comes up we choose state 4; if a tail, we 
choose state 2; the toss of the coin is independent of previous tosses and 
of the history of the system. The parameter a may take on any value 
in the interval 0 S a S 1; the value a = O corresponds to choosing 
state 2 every time; the value a = 1 corresponds to choosing state 4 
every time; a value of a intermediate between 0 and 1 means that 4 
is chosen over 2 a fraction a@ of the time. 

Introducing a natural terminology (from the theory of games), we 
may say that a choice of @ represents a policy or strategy for making 
routing decisions; a value 0 or | of a represents a pure strategy, in which 
the route is specified by a rigid rule, and there is no randomization; an 
intermediate value of a represents a mixed strategy. 
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Fig. 18 — Schema of state diagram showing transition rates. 


A choice of a determines a matrix Q = Q(a,A) of transition rates 
(lig. 18) among states of the system, and so a Markov stochastic 
process taking values on those states. As a measure of performance we 
shall use the fraction b of requests for connection that encounter blocking, 
defined as follows: let b(é) be the number of blocked calls occurring 
in the interval (0,é]; and let r(é) be the number of requests for service 
occurring in (0,¢]; then 

. b(t) 
b= lim oO: 
It can be shown that this limit exists and is constant with probability 
one, so b is well defined. 

The number 6b = D(a,\) can be calculated from the matrix Q as 
follows: if (7,4 = 0, --- , 9) is a state, let 8(¢) be the number of blocked 
idle pairs in state 7, and let y(z) be the number of calls in existence in 
state 7. The stationary state probabilities {p;,7 = 0, --- , 9} exist and 
are the unique solution of the matrix-vector equation Qp = 0. Then b is 
given by | 
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where the inner product (p,2) is >> pi; . 
t=0 


We may therefore formally state our variational problem for this 
example as follows: to find that @ in the interval 0 < a S 1 for which 
the ratio 


= minimum 





It is natural to expect that in choosing an optimum routing method 
in the example above there is no point in randomizing, i.e., using a 
mixed strategy with a unequal to either 0 or 1. That this is so is not 
obvious from our mathematical statement of the problem, and requires 
proof. We shall demonstrate a more general result: 

Theorem 2: Let x and y be vectors of 10 dimensions, with y nonnegative 
and not identically zero. 





min 
I (p, a) d 
or ? Qp = 0, i=1, OSazl 
‘ap . 24? 
max 


is always achieved for a = O0ora = 1. 
Proof: The equation Qp = 0 may be written out in the detailed form 


5 
(it) (QA + 1)pi = 16dpo + 22 Dr 


(sit) (4. + 2)pe 


4\(1 — a@)pi + pe + pz 
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(iv) (44 + 2)ps = 4dpi + pr 

(v) (2\ + 2)ps = 4rdapi + Ds 

(vi) (4A + 2)ps = Api + Ds 

(vit) (A + 3)ps = 2dp2 + 2dps + 4AD5 + 405 
(vi2t) (A + 3)pr = 2dp2 + 4rps + 4p3 

(ix) Aps = ADs 

(x) Apy = Ap7. 


These are the standard “statistical equilibrium” equations for the 
probabilistic model we have assumed. They can be solved by succes- 
sively eliminating every p; except po and obtaining a solution of the 
form 


Di =fipo, 10. 


The value of po is then determined by the normalization condition 
dino Di = Las 
it 


Po = ——z—. 
L+ Qf 

The f; are of course functions of \ and a. We shall prove that they are 
linear functions of the parameter a. 

We first eliminate p, and note that f,; = 16. Since the relations 
(izi)—(tv) contain the variables {p;, 7 = 2,3,4,5} only on the left, these 
variables may be eliminated entirely from (22), and from (viz)—(x). But 
substitution for these variables in (vzz) and (v72z) in terms of (d77)—(v7) 
introduces a and pp» only in inhomogeneous terms. Hence, fs and jf7 are 
linear in a, and so all {f;, 7 = 1, --- , 9} are linear in a. 

Clearly, we have 








because the normalization terms 1 + > 1 fz: cancel out, and so it 
follows that (p,x)/(p,y) is a bilinear function of a, i.e., it has the form 


Ai + Bye 


gla) = As Ba 


where A;, Ao, Bi, and Be are constants. Now 
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a (a) = By(Az + Boa) = Bi(Ai + Bia) 
da 9 (Ay Boa)? 
_ BiA2 — BoAi 
(Ae + Bow)? 


which is of the same sign as its numerator. Thus g (a) is either always 
nonpositive or nonnegative, and so any extremum of g(a) in0 Sa 1 
is assumed at the boundary, either for a = 0 or a = 1. Since the solution 
p of Qp = 0 is known to have all strictly positive components for all a 
in the unit interval, we have Az + Boa = (p,y) > 0. 

It follows in particular that the minimum of blocking probability b 
is achieved for a = 0 or a = 1. It is unthinkable that visiting a blocking 
state (state 4) more frequently should decrease b, so we conjecture (and 
shall shortly prove that) a should be zero rather than one. 

Before doing this though, let us observe that there is only one block- 
ing state (viz., 4), and that the blocking probability b can be written as 





2 
b BM 


1690 + 9p1 +4207: + pe + Dr 
These facts and our intuition suggest that 6 should be a monotone 
increasing function of 


_ Ps 
ak. 


This conjecture is correct, and provides an easy way of showing that 
a = 0 gives the least blocking probability. Let us prove it. 
From (z) and (27) we find that 
5 
> pi = 8A9A + 1)po — 2ZApo = 72d2po 
i=2 
whence 


eee ee 
16 + 144 + 288d? + fs + fr’ 


From (viz)—(«) we find that 


5 
Pe + Pi = —s (rir. + p77) + 4 2 | ia 2m.) 
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Therefore 


2 fs 
~ 6 + 1440 + 556d? + 192d? — 3A fa 


This is of the form 





Qe 
a— cx 





where a and c are strictly positive constants. Now 


d Qe 2 2cx 


dxra—ce a—ce' (a — cx)? 


= 24 
(a — cx)? 


IV 


0. 


Hence, b is a monotone increasing function of f;. It follows that b is a 
minimum if fs is a minimum. 

To prove that the blocking probability b is a minimum for a = 0, 
it remains to calculate ps from the equilibrium equations. By eliminating 
all the equilibrium probabilities except ps and p; , we find 


me exe — a)16dpo 4 8721 6X + 2dD5 
eae wT dn + 2 H+ 2 
AN16p. + 4dp5 , . ) 
a a oe +2 + AD7 
1 (ec — a)16\po a (16X)*Apo + 4p; 
ia Geo Ay a aD 
mt ps (me +P), 


We have purposely not simplified the terms so that their origin can be 
verified. I’rom these two equations we find that 


— Ps 
fe 
l—a 1 
A {| —— + —— 
- l-—a . a 1 Cee x ri) 
= yt Beds te SR ee ND Bae it, ce lh ee ee 
ta KPa AD+OATHR HA V43 3r 
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where 
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128° 2h + 2 d 
1 a ae 
H+3' HRS eras 3d 
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This is positive, because 
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We shall now consider the problem of optimal routing in our (trivial) 
network from a different point of view. Instead of minimizing the ratio 
of unsuccessful attempts to attempts, let us simply minimize the average 
number of unsuccessful attempts in any finite number of events, count- 
ing changes of state and unsuccessful attempts as events. 

In our example, the only choice is between states 2 and 4, when a 
particular call requests connection in state 1. By a policy, let us mean a 
function p(-) on the nonnegative integers taking the values 0 and 1. 
Let x, be the state of the network after n events, n 2 0. We say that 
the system is operated according to policy p(-) if, for each n = 0, 
given that x, = 1 and a choice occurs, the system moves to 


II 
wm S&S 


state 2. if and only if p(n) 
state 4 if and only if p(n) 


Now our intuitive feeling is that going to state 2 is preferable over 
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going to state 4 under all circumstances. At the cost of anticipating 
results to be proven, let us partially order all the possible policies by the 
definition: If p(-) and q(-) are policies, then 
p2=q ifandonlyif p(n) 2 qn) forall n= 0.* 
The shift transformation 7 of policies p(-) is defined by the condition 
Tpo(n) = p(n+1)>n2 0. 


It is evident that p = q implies Tp = Tq. Let /,,(x) = 0, and define 
number of unsuccessful attempts after n events 
L., ,»(«) = E5starting from state x if the system is operated ac- 
cording to policy p(-) 
Let S be the set of states {0,1, ---,9}. 
We shall prove 


Theorem 3: If p = q, then foralln 2 landzeS 


En ol(t) = Ey, ale). 


As a preliminary result (not without its own interest) we shall need 
the 


Lemma: For n = 1 and any policy p(-) 
Ei, p(4) = max £,, p(x). 
zeS 
This says that starting in the (sole blocking) state 4 is always the worst 
way to start, no matter how long we run the system. 


Proof: For n = 1 and x # 4, E,,,(x) = 0 since no unsuccessful at- 
tempts can occur in any state except 4. However, 


so the lemma is true for n S 1. Assume as an hypothesis of induction 
that it is true for n S k. Now for x ¥ 4, Ei41,,(%) is a convex combina- 
tion of values of EL, ,rp(-), so clearly for x 4 4 


Eiey1,p(@) S max He p(y) = Le,p(4). 
However, elementary probability arguments establish that 
Bsr p(4) = Be p(4) + Priae = 4| 0 = 4} E1,24p(4) 
so the lemma is proven. 


* Read ‘“‘p = q’’ as “‘p is better than q’’! 
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Proof of Theorem 3: For any policy s(-) 
E,,(x) =0 if «#4 


2d 
E,,.(4) _ 2+ 4d . 


Hence, 
E, p(w) = B,,.(@) forall weS. 
Assume as an hypothesis of induction that p 2 q implies 
Bin p(%) S By q(x) 
for all x and all n S k. Now for x ¥ 4 or 1 and any policy s(-), Hiss,s(2) 


is a convex combination of values of 
Ex ,rs(-). 


For x = 4, we have for any policy s(-) 


Eysis(4) = + convex combination of Ey.7;(-) 


2r 
2+ 4d 
where the coefficients of the convex combination are transition prob- 
abilities independent of the policy s(-), and 


2r : fhe event is a 


ears = blocked atienne start in state a} ' 


Hence, p = gq and x ¥ 1 implies 
Ex+1.p(X) = Epps) 
For « = 1 and any policy s(-) we have - 


4n 


Buell) = (s(1)Exro(2) + [L — 8(1))Bare(4)} 
1 + 5d 


T+ on convex combination of ELy,7;(-) 


+ 


where the coefficients of the convex combination are independent of 
s(-), and 


An Ny event requires 


I+ routing decision BUOLy State i} ; 


s 


Suppose now that p = q. It is sufficient to show that 
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P(L)Ex,rp(2) + [1 — pl) Ex, rr(4) 
= q() Li, ,rq(2) =f [1 — q(1) |x rq(4). 


If p(1) = q(1), this follows from the hypothesis of induction. The only 
other possibility is that p(1) = 1 and g(1) = 0. By the lemma and the 
hypothesis of induction we find 


Ey rp(2) S Ex rp(A4) 
= Ey ,rq(4). 


This proves Theorem 3. The result at once shows that the policy p = 1 
is optimal in the sense that it minimizes 


lim sup nL, p(x). 
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Algebraic and Topological Properties 


of Connecting Networks 


By V. E. BENES 
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A connecting network 7s an arrangement of switches and transmission 
links allowing a certain set of terminals to be connected together in various 
combinations, usually by disjoint chains (paths): e.g., a central office, toll 
center, or military communications system. Some of the basic combinatory 
properties of connecting networks are studied in the present paper. 

Three of these properties are defined informally as follows: A network is 
rearrangeable if, given any set of calls in progress and any pair of idle 
terminals, the calls can be reassigned new routes (if necessary) so as to make 
at possible to connect the idle pair. A state of a network is a blocking state 
af some pair of idle terminals cannot be connected. A network is nonblocking 
in the wide sense if by suitably choosing routes for new calls it is possible 
to avoid all the blocking states and still satisfy all demands for connection 
as they arise, without rearranging existing calls. Finally, a network is 
nonblocking in the strict sense if zt has no blocking states. 

A distance between states can be defined as the number of calls one would 
have to add or remove to change one state into the other. This distance 
defines a topology on the set of states. Also, the states can be partially or- 
dered by inclusion in a natural way. This partial ordering and its dual 
define two more topologies for the set of states. The three topologies so ob- 
tained are used to characterize (1.e., give necessary and sufficient conditions 
for the truth of) the three properties of rearrangeability, nonblocking in the 
wide sense, and nonblocking in the strict sense. Each of these three properties 
represents a degree of abundance of nonblocking states; the mathematical 
concept used to express these degrees 1s the topological notion of denseness. 
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I. INTRODUCTION 


Any large communication system contains a connecting network, an 
arrangement of switches and transmission links through which certain 
terminals can be connected together in many combinations, usually 
by many different possible routes through the network. Examples of 
connecting networks can be found in telephone central offices, toll 
centers, and military communications systems. 

The connections in progress in a connecting network usually do not 
arise In a predetermined temporal sequence; instead, requests for con- 
nection (new calls) and terminations of connection (hangups) occur 
more or less ‘‘at random.” For this reason it is customary to use the 
performance of a connecting network when subjected to random traffic 
as a figure of merit. One precise measure of this performance is the 
fraction of requested connections that cannot be completed in a given 
time interval, or the probability of blocking. In a telephone connecting 
network this probability measures to some extent the grade of service 
given to the customers. 

The performance of a connecting network for a given traffic level is 
determined largely by its configuration or structure. This configuration 
may be described by stating what terminals or transmission links have 
a switch placed between them and can be connected together by closing 
the switch. The configuration of a connecting network determines what 
groups of terminals can be connected: together simultaneously. Any one 
set of permissible connections may.be called a state of the network. 
Quantities such as the number of combinations of terminals that can 
be connected, and the number of states in which a given combination 
is connected, clearly are indicative of both the performance and the 
cost of the system. If these numbers are small the performance may be 
poor and the cost low; if large, the performance may be unnecessarily 
good and the cost prohibitive. These numbers are among the purely 
combinatory and topological properties of the connecting network. 

For example, in a telephone exchange, the network configuration 
determines what pairs of terminals can be simultaneously connected by 
disjoint paths, that is, what calls can be in progress. If this configura- 
tion is too simple, only a few pairs of terminals can have calls in progress 
between them at the same time. If the configuration is extensive and 


, 
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complicated it may provide for many large groups of simultaneous calls 
in progress, but the network itself may be expensive to build and difficult 
to control. 

To design connecting networks with confidence, then, it is desirable 
to have an adequate general understanding of their combinatory and 
topological properties. A discussion, in part heuristic and tutorial, of 
connecting systems and of some associated mathematical problems has 
been given in a paper! by the author; the reader is referred thereto 
for material suitable as background for the present paper. In that work 
a division of the topic into combinatory, probabilistic, and variational 
problems was drawn, and it was argued that the elements of this division 
had a natural order of priority: one must know the combinatory proper- 
ties of a system in order to calculate its probabilistic properties, i.e., 
its performance in the face of random traffic; and one must know both 
the combinatory and the probabilistic properties of systems in order to 
compare them and to select optimal ones. 

In this paper we shall be concerned exclusively with those combinatory 
and topological properties of a general connecting network that seem 
to be most relevant to its performance. 


II. SUMMARY 


Some of the basic combinatory properties of connecting networks are 
studied in the present work. Three of these properties, rearrangeability, 
nonblocking in the wide sense, and nonblocking in the strict sense, can 
be defined informally as follows: for brevity, define an idle pair to be a 
pair of idle terminals consisting of an inlet and an outlet. A network is 
rearrangeable if, given any set of calls in progress, and any idle pair, 
the existing calls can be assigned new routes (if necessary) so as to make 
it possible to connect the idle pair. A state of a network is a blocking 
state if some idle pair cannot be connected. A network is nonblocking in 
the wide sense if by suitably choosing routes for new calls it is possible to 
avoid all the blocking states and still satisfy all demands for connec- 
tion as they arise, without having to rearrange existing calls. Finally, 
a network is nonblocking in the strict sense if it has no blocking states. 

A distance between states of a connecting network can be defined as 
the number of pairs of terminals that are connected in one state and 
not in the other. This distance defines a topology on the set of states. 
Also, the states can be partially ordered by inclusion in a natural way. 
This partial ordering and its dual define two more topologies for the 
set of states. The three topologies so obtained are used to characterize 
(i.e., give necessary and sufficient conditions for the truth of) the three 
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properties of rearrangeability, nonblocking in the wide sense, and 
nonblocking in the strict sense. Each of these three properties represents 
a degree of abundance of states in which calls are not blocked; the mathe- 
matical concept used to express these degrees is the topological notion 
of denseness. A study of some particular connecting networks that are 
rearrangeable is given in another paper.” 


Ill. THE STRUCTURE AND CONDITION OF A CONNECTING NETWORK 


In discussing connecting networks, we shall abstract from the many 
possible technological realizations and actual designs of connecting 
networks, and shall consider only certain relevant features on which we 
can base a useful and sufficiently general mathematical theory. 

Most real telephone switching networks consist of pairs of wires for 
talking paths and electromechanical switches for crosspoints; in certain 
experimental systems the talking paths are pulse code modulation 
channels, and the crosspoints are time-division gates made of transistors. 
However, any attempt to formulate some general properties of connect- 
ing networks must be independent of the network configuration chosen, 
and of the technology used to build the network, for a particular real 
system. A theory must apply equally well to Strowger switches, crossbar 
switches, gas-diode switches, and time-division switches. Unless it is 
independent of technology, a theory of connecting networks is limited 
in scope and may have missed the heart of the problem. We therefore 
use some of the terminology of switching engineers but understand it to 
refer to defined mathematical idealizations of switches, gates, crosspoints, 
transmission links, etc., rather than to the physical entities themselves. 

We distinguish between switching networks used for communication 
and those used for control functions and logical transformations, like 
relay nets. Our concern is with networks of the former kind, and we call 
these connecting networks. 

A communications switching network, or connecting network, consists 
of three kinds of entities: (2) wires or other transmission media along 
which communication may take place; (22) terminals to which the wires 
are attached; and (727) crosspoints or switches which can be used to 
connect the terminals, and hence the wires, together in various combina- 
tions. Each crosspoint can connect together exactly one pair of terminals, 
and it has two conditions: in the ‘‘on” or closed condition the two 
terminals are connected and communication can pass from one to the 
other; in the “off” or open condition the terminals are disconnected, and 
no information can pass through. 

From the point of view of switching, two terminals connected together 
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by a wire are essentially one terminal, albeit a spatially extended one. 
We therefore regard terminals as identical if they are wired together; 
in mathematical terms, we identify terminals under the equivalence 
relation of “being wired together.’’ Henceforth, then, our considerations 
will leave wires out of account, and will be based only on the notions 
of terminal and crosspoint. 

By the configuration or structure of a connecting network, we mean a 
specification of the terminals between which individual crosspoints 
have been placed. By the condition of a connecting network, we mean a 
specification of the closed and open crosspoints. In most cases of interest 
the structure is invariant in time, while the condition changes in a 
random way. We shall assume that at most one crosspoint is placed 
between distinct terminals, and that no crosspoint is placed from a 
terminal to itself. 


IV. GRAPHICAL DEPICTION OF NETWORK STRUCTURE AND CONDITION 


A simple device can illustrate the four notions we have introduced so 
far. In Fig. 1(a) the nodes (points) represent terminals, and the branches 
(lines) labeled x;, 7 = 1, --- , 6, represent crosspoints placed between 
the terminals. The resulting graph represents the structure of a network. 
If we interpret the labels x; as binary variables specifying the condition 
of the (respective) crosspoints, with 0 meaning ‘‘open” and 1 meaning 
“closed,” then an assignment of values 0 or 1 to {a1 , --- , ve} represents 
a possible condition of the network; illustrated in Fig. 1(b). We are 
purposely avoiding the term “‘network state’’ here in order to assign it 
a useful precise meaning in the next section. 

We have illustrated the use of a labeled graph as a general representa- 
tion for (simultaneously) the structure and condition of a connecting 


x _ 
<5 4 7 Xo x,=1 0 fe) 0 
Lo=0 
<3 ="! 1 
Xe 


(a) (b) 


Fig. 1 — (a) Representation of structure; (b) simultaneous representation of 
structure and condition. 
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network. This representation is useful because it identifies the structure 
and condition of the network with a definite mathematical entity. It will 
become apparent that simple properties of this mathematical representa- 
tion have great theoretical and practical relevance to congestion prob- 
lems. 

In general, the labeled fap g Yepresenting* the structure and con- 
dition of an arbitrary connecting network is constructed as follows: 

z. nodes (points) of g correspond to terminals of the network; 
at. branches (lines, or edges) of g correspond to crosspoints of the 
network; 

11. open crosspoints are labeled 0; 

av. closed crosspoints are labeled 1. 
Two terminals are connected in g if g contains a chain of closed cross- 
points from one terminal to the other. 


V. NETWORK STATES 


Let G be a graph representing the structure of a switching network, 
and let V be the set of all labeled graphs g (labeled ‘‘versions” of @) 
obtained by assigning 0 or 1 to each line of G. There are several reasons 
why not every element g of V represents a physically meaningful state 
of the network. 

In most switching systems there is an explicit functional distinction 
between terminals which are used only to connect other terminals 
together, and those between which desired connections arise, and which 
are never used to connect other terminals together. Terminals of the 
former kind we shall call links, because of their intermediary nature, 
and those of the latter kind, inlets and/or outlets. Desired connections 
always arise between two or more inlets or outlets. If more than two 
are involved, the connection is termed a ‘“‘conference”’ call. Usually, 
though, the connections are disjoint chains of closed crosspoints, assur- 
ing private conversations between inlets and outlets by pairs only; 
we restrict attention to these. In terms of our graphical representation 
of the structure and condition of a switching network, the distinctions 
made above impose restrictions on the elements of V which represent 
realistic conditions of a network having the structure of G. 

The restrictions on the assignment of the labels 0 or 1 listed above 
are (perhaps the most important) among many which are imposed by 
the functional and operational features of a real switching system. In 
general, a real connecting network specifies (or uses) only a subset of 


* A glossary of mathematical notations appears at the end of this paper, Sec- 
tion IX. 
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the set V of all possible labeled versions of the graph G that represents 
the structure of the network being studied. We have therefore avoided 
calling elements of V “states of the network’ because not all members 
of V can reasonably represent the condition of an actual network. 
We now attempt to characterize those subsets S of V which can repre- 
sent real networks. Each such subset S will be called a class of network 
states. 

The Boolean operations of join U and meet N (union and intersection, 
respectively) are definable for elements x, y of V in an obvious way: 


a Uy = the V-element having a 1 wherever either x or y has a 1, 
and 0 elsewhere, 

«fly = the V-element having a 1 wherever both x and y have a 1, 
and 0 elsewhere. 


The complement x’ and the difference 2 — y can be defined analogously. 
In view of this it is natural to inquire whether these Boolean operations 
can be used to characterize subsets S of V which are classes of network 
states. 

If the elements x, y of V belong to such a subset (class of network 
states) S, it is not necessarily true that x U y, nor that x M y, belongs 
to S. In the case of « U y, there may be links and crosspoints used in 
both x and y, and so x U y may violate the requirement of privacy. 
Even if x M y = O there may still be inlets used in both x and y, so 
that x U y would lead to undesirable paths of extreme length. In the 
case of « M y, there may be so little in common to x and y that +f y 
reduces to a single closed crosspoint between two links (i.e., not between 
an inlet and an outlet). Thus the Boolean operations do not yield a 
useful way of describing S. 

The preceding remarks suggest that since any connection is a chain, 
none of whose terminals and crosspoints occurs in another connection, 
the labels 0 and 1 are really superfluous, although they served a tutorial 
purpose heretofore. That is, in describing the possible subsets S of net- 
work states, we can (and should) take advantage of inherent physical 
restrictions, and conveniently replace our representation* x e V of the 
structure and condition of a network by a corresponding set of disjoint 
chains, since each physically meaningful element x from V is equivalent: 
to such a set. A formal development of this suggestion follows. 

Let T be the set of terminals of a connecting network. The graph G 
representing the structure of the network is a subset G of the product 


TXT = {(u,v)|ueT,ve T} 


* “y ¢@ V” means that z is an element of the set V. 
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with the properties 
(u,v) eG ifandonlyif (v, u) eG 
(u,u) isneverinG 
and the interpretation 
(u,v) eG if andonly if (u,v) is an edge of graph G 
if and only if nodes uw and v are adjacent in the graph G 
if and only if there is a crosspoint between terminals u 
and »v. 


A chain p of length n between terminals u and v is a sequence of 
elements {z, ¢ T,0 S$ 7 S n} such that 


4:2; for +7, 
(2;,2:41)€G for 7=0,---,n—1. 


Two chains p; and 7, are called disjoint if they have no nodes (terminals 
e T) in common; in this case we write symbolically pi M po = ¢, with 
@ = null set. 

We shall henceforth assume that the set 7 of terminals has been 
(functionally) decomposed into three sets: 


= TUQULZ, 


where J is a set of inlets, 2 a set of outlets, and ZL is the set of links. 
It is possible that J = Q or that JM Q = empty set, or that some inter- 
mediate condition obtain. However, we shall insist that (J U 2) N L 
be null, i.e., that no link be an inlet or an outlet. 

The set C of connections consists of all chains p = {z;¢ 7,7 =0,---. 
n(p)} such that 


zoel, Zn(p) € Q, 20 ~ 2n(p) 
z,e€L, for *#0O or n(p). 


Each element p of C represents a possible connection from an inlet to 
an outlet through the network whose structure is represented by the 
graph G. 

Elements of the set S of network states will be defined as subsets 
x of C, x C C, consisting entirely of disjoint chains, that is, such that 


M1, p.¢€x implies pf po = ¢. 
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Two subsats x and y of C are called compatible if 
piéex, prey implies pif po = ¢. 


The connections that comprise compatible states can all be put up 
simultaneously without interfering with each other or violating the 
requirement of privacy. 

The functional and physical restrictions imposed by real networks 
determine (in any particular system) a subset E of C consisting of 
(what we shall call) the elementary states, or single connections that 
can actually be used. or example, chains in C that double back and 
are wastefully circuitous may be excluded from F. 

Given such a subset / of elementary states, we can define a class of 
network states S, associated with ZH, in a natural way as follows: S 
is the smallest class of subsets of # containing all unit subsets of Z, 
and closed under formation of arbitrary intersections (meets) and unions 
(joins) of compatible subsets of EL. That is, S is the smallest class of E- 
subsets such that 


peli implies {p} €S, 
x,yeS implies «fy eS, 
if a,yeS and pier, prey implies pi. Nm = ¢, 
then aUyeS. 


We henceforth use “iS” as a generic notation for a class of network 
states defined as above. The word ‘‘network”’ will refer to a graph G 
representing structure, choices J and Q of inlets and outlets respectively, 
and a choice / of elementary states. The choice of G, J, 2, and # uniquely 
determines a class S of network states according to the definition given 
previously. The quadruple (G, 7,2, #) will be called a network, N. 

It is easily verified that the class S of network states is partially 
ordered by inclusion, <. Moreover,. any two elements x, y of S have a 
unique intersection (meet) consisting of just those connections common 
to both x and y, and S itself has a unique least element included in every 
other element, viz., the ground state in which no calls are in progress. 
However, since only infima exist, and since there may be many maximal 
elements in the partial ordering, S is not a lattice, in general. 


VI. THE STATE DIAGRAM 


The partial ordering < of S has a special nature that allows us to 
arrange the network states « e S in a particularly intuitive and useful 
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pattern. The following conventions and definitions will be helpful in 
discussing this pattern. 

If K is any set, we use the notation | K | to mean the number of 
elements of K. FE.g., if x is a network state, 


| «| = the number of calls in progress in state x. 
The sets LZ; are defined by the conditions 
L, = {xeS||2| = k}, k=0O,1,-°:-, 


that is, Z, is the set of all network states consisting of exactly k con- 
nections. Lo is a unit set containing just the zero state. The sets Ly 
are a partition of S corresponding to the equivalence relation of “having 
the same number of calls in progress.” 

To obtain our pattern for arranging network states we start with the 
zero or ground state in which no calls are in progress: this is the empty 
set (of chains). Above this zero element, in a horizontal row, we place 
all the states consisting of a single connection, i.c., all the elements of 
£. Continuing in this way, we put the set Zy4;, of states consisting of 
(k + 1) disjoint chains (i.e., k + 1 calls) in a horizontal row above the 
set L, of states with / disjoint chains (i.e., k calls in progress). We call 
LL, the kth level. 

The diagram is completed by constructing the corresponding Hasse 
figure (Birkhoff,’ p. 12); that is, we think of the states « ¢ S (in their 
arrangement into levels L;) as nodes, and we construct a graph by 
drawing lines between states x, y of respective adjacent levels LZ; , Ln+1 
just in case 


y—axels, 


ie., if and only if y results from # by putting up one more call. The 
resulting graph can be termed the state diagram D of the network N 
described by the quadruple (G, J, 2, F). The state diagram D is a 
natural and standard representation of the partial ordering of S. The 
history of the connecting network when operating can be thought of as 
a trajectory on D. | 

We shall use the network depicted in Fig. 2 to illustrate the state 
diagram D. For most practical purposes this network is wasteful of 
crosspoints, but it makes a suitably simple example of the partial 
ordering of the states. The network has four inlets and four outlets, 
and no inlet is an outlet. The squares in Fig. 2 represent 2 by 2 switches, 
as indicated. 

The possible states of this network are determined by all the ways 
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2x2 SWITCH ee i ee = 


xX = CROSSPOINT 


Fig. 2 — Illustrative three-stage connecting network. 


in which four or fewer inlets can be connected pairwise to as many 
outlets on the right, no inlet being connected to more than one outlet, 
and vice versa. These possible states have been depicted in a natural 
arrangement in Tig. 3, which shows a reduced state diagram in which 
states which differ only by permutations of inlets, outlets, or switches 
have been identified. There is essentially only one way to put in a single 
call; there are four ways of putting in two calls; and there are two ways 
each of putting in three and four calls. The states have been arranged 
in levels according to the number of calls in progress. In each state 
only links actually in use are shown, and the different notations on the 
links indicate the routing. 


VII. SOME NUMERICAL FUNCTIONS 


The finite set S of network states is partially ordered by inclusion, 
which we shall denote by S. A chain in S is a subset X of S which is 
simply ordered by (the restriction to X of) S; that is, for any two ele- 
ments xv, y « X, we have either « = y or y 2 x. Such a chain is not to 
be confused with the ‘‘chains” on the graph G that are elements of 
states x e S. The dimension or height | «| of a state is the maximum 
“length” d of chains 0 < a1 < -++ < xq = x that have zx for greatest 
element. (This usage is consistent with the previous definition of | - |.) 

Remark 1: The dimension | 2 | of a state x is the number of busy pairs, 
or the number of calls in progress, in the state x. 

A state x is said to cover another state y if and only if x > y, and 
there are no z e S such that « > z > y. The state x is then “immediately 
above” y. It is apparent that z covers y if and only if x > yand|az| = 
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Fig. 3 — (Reduced) state diagram for the network shown in Fig. 2. 


is 


|y | + 1. In fact, the construction of the partial ordering of S arranges 
the states according to levels, each level being the (equivalence) class 
of all states having the same dimension. In determining dimension one 
need only consider chains that are “maximal” or “connected” in the 
sense that 2; covers x,_; for all 2. Also, it can be seen that the partial 
ordering S of S satisfies the Jordan-Dedekind chain condition: all 
connected chains between fixed end points have the same length. 

The present section will be devoted to various relationships between 
numerical functions defined on S, counting or ‘“enumeration”’ problems, 
etc., based largely on the dimension function and the chain condition. 

The Mobius function p(-) of the partially ordered system (S, S) 
is defined recursively by 


uO) = 1, = w(x) = — 2 wy) if «> 0, 
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where 0 denotes the zero or ground state in which no calls are up. 
The Mobius function has the following two important properties: 
t. Let f(-) be any function defined on S, and let 


F(z) = Dfty). 
yz 
Then f(-) and F(-) are related by the Mobius inversion formula 
f(z) = 2X u(y)F(a — y). 
yu 


Here « — y denotes the state obtained from x by removing all the 
calls of state y; this makes sense, since y < x. (See Weisner.) 

a. Let A(x, n) be the number of chains of length n which can be 
interpolated between 0 and x. Then P. Hall’ has shown that 


—a(e) = Aa, 1) — A@, 2) +e. 


By the Jordan-Dedekind chain condition, all the chains from 0 to x 
have the same length, viz., | « |. Hence for x > 0 


u(x) = (—1)'*' Ae, | e]). 
For simplicity of notation set 


A(a, | a |) 


n(x) 
= number of ways of ‘climbing’ from 0 to wx. 
Also, we introduce the following sets: 
A, = {y|y covers x} 
B, = {y| «x covers y} 
Ly = 1a) |e | =a}. 


These have the following respective intuitive meanings: A, is the set 
of states immediately above x, i1.e., obtainable from x by adding one 
more call; B, is the set of states immediately below z, i.e., obtainable 
from x by removing one call; L, is the nth level, the set of all states 
having n calls up. The cardinality of a finite set X is designated by 

Remark 2:|B,| = || for each x « S. Clearly, x covers exactly 
| «| states, each obtainable from x by removing one call. 

Remark 8: For each x ¢ S 


n(x) = p>? n(y)- 
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Indeed, every state y covered by x gives rise to exactly n(y) climbing 
paths from 0 that reach x via y. 

Remark 4: For « ¢ Ln, n(a) has the constant value n! This is ob- 
vious intuitively, since there are n! orders in which the n calls of x € LZ, 
could be put up. More formally, the result is true for « = 0; assume it 
true for y ¢ L,-1 ; then by the previous results, 


n(x) = p> ny) = |B.|-(— 1)! 
= Ml 
Remark &: The Mobius function yu(-) is given by 


w(x) = (-1)!*! (Ja)! 
= (-1)"n! for vel,. 
Theorem 1: 
1 
| Ln] = = > | Ayl, n> 0. 
yeLn_1 
Proof: The segments in the partial ordering passing from elements 
y € L,1 to L, are just those that pass from some x ¢ L, to LZ,-1 and by 
Remark 2, each a e L, has exactly | x | (= n) such segments. Therefore, 
n-|L,| = 27 | Ay| 
yeLn—1 
and the sum on the right is exactly divisible by n. 
Definition: C,, is the total number of chains (of length n) from 0 into 
L,, , 1.e., to some state in Ly. 
Remark 6: 


C(n) = » nw) = > a(y):| Ay |. 


It can be seen that x e L, has n(x) chains climbing to it from 0; for 
x,yeL,,x # y, these chains are distinct since their highest elements 
are unequal. This proves the first identity. Also each chain climbing 
to L, from 0 must pass through some unique y € Lnz_1. Each y € Lyi 
has n(y) chains of length n — 1 reaching it from 0, and each such chain 
can then be completed to reach L, in | A,| ways. It follows also that 
rae oo eee 
feb Se Au. 


n Yebniy 
VIII. ASSIGNMENTS 


By an assignment we shall mean any one-to-one map a(-) of a subset 
of J into 2. An assignment is to be interpreted as a specification of what 
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inlets are to be connected to what outlets, without regard to the possible 
routes that these connections might take through the network. If 
I 1 Q is nonnull, we restrict assignments so as to satisfy a(u) # wu. 

Let x be a network state consisting of chains 1, po,-+: , Dn With 
n = n(x) and each p; a chain between u; ¢ J and v; ¢ Q. We say that 
x realizes the assignment a(-) if and only if 


t. the domain of a(-) is {u;, 0 S$ 7 S n(a)} 
a. the range of a(-) is {vi:,0 S27 S n(a)} 


wa. au) =v;,0 578 na). 


An assignment is realizable if some network state realizes it; a state 
realizes exactly one assignment; the zero state realizes the null assign- 
ment. A maximal assignment is one that has either domain J or range Q. 
The set of all assignments is denoted by A, and that of all maximal 
assignments by A. 

Two terminals, u e¢ J and v « Q, are connected in state x if and only if 
some chain p e x is a chain between wu and 2, i.e., if and only if x realizes 
the (unit) assignment 


[(u, v)}. 


We define the function y(-) from S into (the set of) subsets of J & Q 
by the condition 


v(x) = {(u, v) ef X Q| wand v are connected in 7}. 


Formally, then y(x) is the assignment realized by state x; heuristically, 
we may think of y(x) as the set of calls which are in progress in state x. 
The set of unit assignments, that is, of 


c = {(u,v)} such that (u,v) eZ XQ, 


will be denoted by U, and a unit assignment c « U will be referred to 
informally as a call. 
If a = a(-) € A is an assignment, we use the notation 


y1(a) 


for the inverse image of a(-) under y(-), 1.e., the set of (equivalent) 
states y such that y(y) = a. In a similar vein, if X is a set of states, we 
define 


y(X) = {aeAla=vy(x) forsome we X}, 


that is, y(X) is the set of assignments realized by members of X. 
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IX. THREE TOPOLOGIES 


Two network states x and y are equivalent, written x ~ y, if and 
only if they realize the same assignment, i.e., 


v(x) = y(y). 
Intuitively, equivalent but nonidentical states correspond to different, 


ways of putting up the same set of calls. 
A pseudo-metriec (Kelley,® p. 118) on S can be defined by the formula 


d(x, y) = | y@Avy) |, x yeS, 
where A denotes the symmetric difference of sets, and | - | cardinality, 
as before. In plain words, the distance d(x, y) between x and y is the 
number of pairs (u, v) e J X Q that are either connected in x and not 
connected in y, or connected in y and not, connected in x. Clearly 


d(x, 0) = |x], 0 = zero state, 
and also 
d(x, y) = 0 ifandonlyif «~y. 


Thus d(-,-) only identifies states up to equivalence. The function 
d(- , +) is obviously symmetric, and the triangle inequality is a conse- 
quence of the set inclusion 


(XAY) € (XAZ) U (YAZ). 


The pseudo-metric d(- , -) can be used to define a topology for S ina 
standard way (see Kelley,® p. 118 et seq.) The closure of a set X in the 
d-topology consists of all states equivalent to members of X, and is 
denoted by X?. 

For each subset X of S, we define its <-closuwre X by the condition 


X ={yeS|ySX-2 forsome xe X}. 


The operation on sets so defined satisfies the Kuratowski closure axioms 
(cf. Kelley,® p. 43): 


¢=¢ 

xox 

LX 
xXUY=xXUY 


th 


and so defines a closure topology for S. The set X consists of all states 
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that are “below” some member of X in the state-diagram D, i.e., can 
be reached from a member of X by removing calls. 
In a similar way, we define the 2-closure X of a set X C Sas 


X ={yeS|y2x forsome xe X}. 


The converse of a partial ordering relation is also a partial ordering, 
called its dual. Hence the mapping X — X is also a closure operation, 
defining a third topology on S. 


X. SOME DEFINITIONS AND PROBLEMS 


An inlet or outlet is zdle in a network state «x if it belongs to neither 
the range nor the domain of the assignment y(x) realized by x. An 
idle pair of the state 2 is an element (u, v) of J X Q such that both 
u and v are idle in x. A call c = {(u, v)} is new in a if (u, v) is an idle 
pair. 

We shall now define what is meant by a blocked call. Let a ¢ S realize 
the assignment (x) and let ¢ be a new call in 2, ie., let 


ec = {(u,v)} €U 
be a unit assignment such that (w, v) is an idle pair of «. The new call c 
is blocked in x if there is no state y > x such that 

yy) = v(x) Ue. 

A state 2 is a blocking state if some call is blocked in x. The state 2 is 
called nonblocking if and only if for every idle pair (wu, v) of x, the call 
c= {(u, v)} 
is not blocked in 2, i.e., there is a y e S above x which realizes the larger 

assignment y(x) U ¢, so that 
vy) = v(x) U {(u, »)}, 
Yo >. 

The set of nonblocking states is designated by the symbol B’. A 
state that realizes a maximal assignment has no idle pairs, and is (triv- 
ially) nonblocking. In plain terms, a nonblocking state x is one in which 
any idle inlet u can be connected to any idle outlet v without disturbing 


the calls that are already present; in this case there is a path r, disjoint 
from all paths p e x, between u and v, and 


xz U {fr} €S, 


1e., use of this path results in a network state. 
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A network N = (G, J, Q, F) will be called nonblocking in the strict 
sense if and only if every state is nonblocking, i.e., B’ = S. Such net- 
works have been discovered and studied extensively by C. Clos. (See 
Clos’ and Kharkevich.*) A network that is nonblocking in this strong 
sense has the property that no matter in what state it is, any idle pair 
can be connected (in a way that results in a legitimate network state). 

In most switching networks there may be several or many ways of 
connecting an idle pair, i.e., putting up a new call, in a given state, 
all of which lead to legitimate network states. Thus, even if the set S 
of network states contains blocking states, it is conceivable that by 
making the right choices of paths for connections one might avoid all 
the blocking states, and still satisfy all demands for connection as 
they arise, without disturbing calls already present. That is, there may 
exist a rule for choosing paths which, if followed, confines the trajec- 
tory of the system to nonblocking states (without refusing any demands 
for connection by idle pairs). 

We next discuss what is meant by a rule. If a call c = {(u, v)} Is 
blocked in a state x it cannot be put up without disturbing existing 
calls of x, and there is no question of using a rule. Also, if x is a maximal 
state, no new calls can be put up, and a rule is unnecessary. But if a 
call c can be put up in one or more ways in the state x, then there is at 
least one y > x such that y(y) = y(z) U ¢. In such a case some method 
of specifying permitted or prohibited new states could be used in order 
to improve performance. 

A rule p(- , +) for a network N is a mapping of the Cartesian product 


[S — y(A)] x U 


into subsets of S, with the properties: if « «e S andc = {(u, v)} « U 
with (u, v) an idle pair of x (so that c is a new call in x), then 


0S p(x, c) Cy "Ur(x) Uc); 


if x is maximal, or if (wu, v) is not idle, p(x, c) is defined (arbitrarily) 
as the null set. If for some call c not up in x we have 


ye p(x, ¢), 


we say that the transition (between states) « — y is permitted by p(- , -). 

We say informally that a state x is reachable under a rule p(- , -) if 
there is some sequence of changes of state, consisting of either hangups 
or transitions permitted by p(- , -), and leading from the zero state to 
x. More precisely, we define the notion 
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x is reachable under p(- , -) in steps 


recursively, as follows: 

1. The zero state is reachable under p(- , -) in zero steps. 

a. If 2 is reachable under p(- , -) in n steps, and for some call c e U, 
v(x) = y(y) Uc, then y is reachable under p(- , -) in (n + 1) steps. 

at. If 2 is reachable under p(- , -) in n steps, and for some eall c ¢ U, 
c is new in a and y € p(2, c), then y is reachable under p(- , -) in (n + 1) 
steps. 

A state is reachable under p(- , -) if it is reachable under p(- , -) 
in n steps, for some n = 0. The set of states that are reachable under 
p(- , -) will be denoted by R,. 

A network N = (G, I, Q, £) will be called nonblocking in the wide 
sense if and only if there is a rule p(- , -) for N under which no block- 
ing state is reachable, 1.e., 


Rese B' 


In words, we may say that a network is nonblocking in the wide sense 
if there is a rule, depending on the states, and on the connections that 
are requested, such that if the rule if used (starting from the zero state) 
no blocking state is ever reached, and hence no request for connection 
by an idle pair (of a state that can be reached) need ever be refused. 
In making this definition, we think of the system as starting (empty) 
at the zero state; in any state x that it reaches, any idle pair of x may 
demand connection; it must always be possible to make this connection 
without disturbing existing calls, and reach a (nonblocking) state y 
one level higher, y ¢ Zy2)4 1; at any instant an existing call may termi- 
nate, and the system move to a state of Z)2;_1. An example of such a 
network was given in Ref. 1. 

Finally, we consider a still weaker property of networks than the 
first two defined, namely, the possibility of satisfying a demand for 
connection by rearranging (if necessary) the existing calls in such a 
way that the desired call can then be accommodated. Let x be a net- 
work state realizing the assignment y(x). We call x rearrangeable if and 
only if for every idle pair (wu, v) of x there is a y e S, possibly depending 
on (u, v) and 2, which realizes the larger assignment y(x) U {(u, v)}, 
L.e., 


yy) = r(x) U {(u, »)}. 


Alternately x is rearrangeable if for every call c new in x there is a y 
such that 
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vy) = v(x) Ue. 


This definition is the same as that of a nonblocking state except that 
the condition x < y is omitted. That is, to realize the larger assignment 
v(x) U ¢ it may be necessary to reroute existing calls to give a new 
state zg ~ x which is not comparable to x, and which has a path 7, 
disjoint from p ¢ z, between u and v. The state y may then be taken 
to be z U {r}. A network N is called rearrangeable if its states x ¢ S are 
rearrangeable. 

With these definitions laid down, we can formulate several problems 
of the combinatory theory of connecting networks: 

t. Can general characterizations of the properties of being rearrange- 
able, and of being nonblocking (strict or wide sense) be given? | 

a. What relationships exist among the concepts we have defined? 

at. What specific networks are rearrangeable, or non-blocking (strict 
or wide sense)? 

To attack problem (z) we make the following observations: the three 
properties of interest represent different degrees of abundance of states 
in which calls are not blocked. The relative abundance or density of 
such states throughout S determines which (if any) of the three prop- 
erties N has. The heuristic concept of abundance suggests the topological 
one of denseness, and the possibility of characterizing the three proper- 
ties in terms of denseness. This idea is developed in the remaining sec- 
tions; it leads to answers to problems (7) and (27) above. 


XI. REARRANGEABLE NETWORKS 


Let X be a subclass of the class S of network states. We say that X 
is sufficient if y(X) = A, 1e., if every assignment is realized by some 
state of X. We make two comments: 

Remark 7: If A C y(x), then X is sufficient. This can be seen as 
follows: every assignment is a subset of some maximal assignment, and 
so belongs to the S-closure X of X. For the same reason we have 

Remark 8: The following properties of a network N are equivalent: 

1. N is rearrangeable. 

17. Some sufficient class exists. 

wii. The range of y(-) includes A. 

It is convenient to approach the study of rearrangeable networks by 
taking the point of view of a particular pair of customers, i.e., of a 
particular inlet-outlet pair (u, v) e J X Q. Such a pair corresponds to a 
unit assignment or call 


c= {(u, v)} € U, 
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any realization of which is among the states of ’, the set of elementary 
states. For each call c « U we define 


I, = {ve S| cis new in a, i.e., (uw, v) is idle in z}, 
B, = {x € S| c is blocked in 2}. 
It can be verified that 
oe ey es for ce U, 
Br = N (B,y', 


ceU 


S—y (A) =UL,. 
ceU 


We call a network N rearrangeable for the unit assignment or call c 
if and only if for every x « J, there is a y « S — I, which realizes the 
larger assignment y(x) Uc = y(y). In words, this condition states that 
for any state in which the pair (wu, v) is idle there is a (possibly rearranged) 
state in which all the same calls are up, and in addition wu is connected 
to v. It is easy to see that N is rearrangeable if and only if it is rearrange- 
able for all calls c e U. 

Let X, Y be arbitrary subsets of S. In accord with a standard definition 
(Kelley,® p. 49), X is said to be dense in Y in the d-metric if Y is in- 
cluded in the d-closure of X, i.c., 


Vx 


Now in a metric space the closure of a set X is the set of all points that 
are at distance zero from X, when the distance of a point y from a set X 
is defined as 


inf d(a, y) . 
eX 
Hence the closure of X is the set of all y such that for some « ¢« X, 


d(x, y) = 0, or equivalently, « ~ y. That is, the d-closure of X is the 
set of all states that are equivalent to a member of X: 


X¢= {yeS|y~au forsome axe X}. 


These observations lead to the following result: 
Theorem 2: N is rearrangeable af and only if 


(B.)' ts d-dense in I, , for eachce U. 


Proof: Let N be rearrangeable; let c « U; and pick x in J, . Then there 
exists y e S such that 
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ry) = r(x) Ue, 


and so there exists a z e H 1 y~(c) such that z S y anda ~ y — 2. 
Obviously then 


y — ze (B.)’ 
and since x is equivalent to y — z we have 
x € ((B.)’)4. 


Since x was an arbitrary member of J, , we have proved J, € ((B,)')4. 
Conversely, assume that the condition in the theorem holds, and pick 
any ce U, andwveJ,. Then x ~ y for some y in (B,)’, so that ¢ is not 
blocked in y. Thus N is rearrangeable for all c « U, and so is rearrange- 
able. 

A similar argument yields the weaker and simpler result: 

Remark 9: If B’ is d-dense in S, then N is rearrangeable. In this case, 
since S € (B’)4, given a state x there is always an equivalent nonblock- 
ing state y, with 


yr ar, ye B’. 


Hence rearrangements can be made uniformly in the calls new to z. 


XII. NETWORKS NONBLOCKING IN THE WIDE SENSE 


We now turn to the characterization of networks for which there is a 
rule for routing calls which allows the operating system to avoid blocking 
states entirely. The case in which the network is actually nonblocking 
in the strict sense, so that any rule will do, is excluded here as trivial. 
The point is to use a network with blocking states, but to manage to 
avoid them by clever routing. The following general criterion of a useful 
rule p(- , -) suggests itself: p(- , -) should make as many blocking 
states as possible unreachable, consistent with satisfying requests for 
connection by unblocked new calls. 

To exhibit, in an intuitive way, all the relationships that obtain, it is 
convenient to introduce an additional concept: a class X of network 
states is preservable (by new calls) if and only if for any x e X and any 
call c that is new to x and unblocked in z, there is a state y e X such that 


y>x and yy) = y(x7) Ue. 


That is, if an idle pair (u, v) of x corresponds to a call ¢ = {(u, v)} 
that is unblocked in x, then some state y « X realizes y(x) U {(u, v)}, 
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and y is above 2 in the state-diagram, y > a. In words, X is preservable 
if any call that can be put up at all in a state of X can be put up salva 
staying in X, that is, in such a way that the system stays in X. A 2- 
closed class is always preservable (by new calls). We make 

Remark 10: If X is preservable, 0 ¢ X, and X € B’, then X is sufficient. 

It is then possible to start at the zero state, and call by call realize 
any maximal assignment salva staying in X. We now state 

Theorem 8: N is nonblocking in the wide sense if and only if there 
exists a nonempty subset X of states such that 

t. X is preservable. 

a. X SB. 

at. X is S-closed, i.e., X = X. 

Proof: Let (z)- (Gin) held for some subset X, and define a rule p(- , -) 
by the condition that if ¢ e U is new to x, then 


p(x, 6) = y3 (yz) Ua) NX. 


Use of p(- , -) is tantamount to requiring that any call must be put up 
so as to lead to a state of X. By (2) and (72), this can always be done. 
Since X is S-closed, hangups preserve membership in X; since X is 
nonempty it contains the zero state. Hence all states reachable under 
p(: , -) belong to X AN B’ and 


R, CB’, 


so that N is nonblocking in the wide sense. 

Conversely, if N is nonblocking in the wide sense, then some rule 
p(:, -) is such that no blocking state belongs to R,. Set X = R,. 
Then X is <-closed, because any state below a reachable state is reach- 
able by hangups. Also X € B’, because p(- , -) avoids all blocking 
states. Finally, X must be preservable since one can ‘“‘preserve’’ X simply 
by using only state-transitions permitted by p(- , -), 1.e., by putting up 
unblocked new calls so as to lead only to states vouchsafed by p(- , -). 

We recall that for x e S, 


A, = {y| y covers x}, 


{y| y = x Uz for some z e E}, 


{set of states immediately above 2}. 


The property of preservability (of a set X of states) will now be given 
a topological characterization in terms of denseness, in the following 
result: 
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Theorem 4: A nonempty subset X of S ts preservable if and only af for 
every x « X, A, X ts dense in Az in the sense of the d-metric; 7.e., « « X 
implies 


A, €(A,N X)4. 


Proof: Take x « X and y e Az, so that y is “immediately above” z, 
or y covers x. Then there is a call ¢ new in x such that 


vy) = r(x) Ue, 


and so if A, M X is dense in A,, there is a z ¢ A, f X which is equiva- 
lent to y. Since z covers x, it follows that the call c new to x can be 
connected in state x so as to give rise to a state of X. That is, we have 


zexX 
y(z) = v(x) Ue. 


Since c was an arbitrary new call of x « X, the set X is preservable, if 
the condition of Theorem 4 is true. Conversely, let X be preservable, 
and take « e X and y e A,. Then there exists a call ¢ not blocked in 
x with y(y) = y(x) U ¢. But since X is preservable, and ¢ is not blocked 
in x, there is a z in A, MN X such that y(z) = y(x) Ue, that ise ~ y. 
Hence y is equivalent to an element of A, M X. Since y was arbitrary, 
it follows that for x ¢ X, 


Ay Cc (A, Nn a Ge 


Remark 11: The sets {A,, « « X} in the condition of Theorem 4 
may be replaced by the “x-cones” 


fyly > a}. 


XII. NETWORKS NONBLOCKING IN THE STRICT SENSE 


A network that is nonblocking in the strict sense has no blocking 
states whatever. A simple characterization of this property is given by 

Theorem &: N is nonblocking in the strict sense uf and only tf there is a 
subset X of B’ such that 

1. X 1s sufficient 

ti. .X is d-closed, i.e., X = X’. 

Proof: If N has the property, then S = B’, and we may take X = S. 
Conversely, if (z) and (iz) obtain, take any x e S; since X is sufficient, 
there exists y « X for which y(x) = y(y), ie., 2 ~ y. But X = X4, 
so x e X, and hence x e B’. 
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By Remark 10, the condition (7) that X be sufficient can be replaced 
by the condition that X be preservable and nonempty. 


IX. GLOSSARY 


I SRARDNEBVHARTE DA 


—_o 
8 


¥(x) 
d(x, y) 
xX 


xX 

X4 

B 

p(- ’ ") 
Ky 


an arbitrary graph 

a copy of G with each edge labeled 0 or 1 
the set of all labeled versions g of G 

the set of network states (typical members 2, y, 2) 
the set of terminals (nodes of G) 

a typical path (chain) on G 

null set 

the set of inlets 

the set of outlets 

the set of links 

the set of all connections (paths from J to Q) 
the set of elementary states 

an arbitrary network, specified by choosing G, J, 2, and E 
partial ordering of V or S by inclusion 

zero state 

number of elements of the set X 

set of states with exactly & calls up 

state diagram (Hasse figure of S on S) 
Mobius function of S$ 

number of chains of length n from 0 to x 
A(z, | x |) 

set of states directly above x 

set of states directly below x 


> (x) 


zeLn 

an assignment (any 1-1 map of subset of J into Q) 
set of maximal assignments 

set of unit assignments or calls 

a call, or typical member of U 

the assignment realized by state x 
equivalence of states 

| y()Ar(y) | 

S-closure of X 

=-closure of X 

d-closure of X 

set of states in which some call is blocked 
a rule for operating a network 

the set of states reachable under p(- , -). 
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Solution of Systems of Linear Ordinary 
Differential Equations with Periodic 
Coefficients 


By H. E. MEADOWS 
(Manuscript received February 7, 1962) 


An analysis technique is presented to provide an essentially explicit 
solution for a system of n simultaneous first-order linear differential equa- 
tions with periodic coefficients. This representation of a periodic variable- 
parameter linear system of arbitrary finite order is chosen for its theoreti- 
cal and practical advantages over the classical nth order linear differential 
equation. Emphasis is placed on natural mode solutions of a homogeneous 
set of equations. The characteristic exponents for these solutions are deter- 
mined from a polynomial equation the coefficients of which are linear 
combinations of n — 1 convergent infinite-order determinants. Approximate 
calculation of these determinants ts feasible for problems of moderate order. 


I, INTRODUCTION 


Systems of linear ordinary differential equations with periodic coeffi- 
‘clients are assuming an increasing importance in engineering problems. 
Two applications of present interest are periodically time-variable net- 
works and multimode waveguide with periodic physical distortions. 
Such applications have usually been analyzed by methods appropriate 
to special cases such as the second-order case or by approximate tech- 
niques valid for almost constant-parameter systems. However, pertur- 
bation techniques for almost stationary systems are inadequate for 
careful analysis of large-signal behavior of time-variable networks. 
Similarly, a periodically distorted helix waveguide, for which more than 
two modes must be considered,’ should be described by a differential 
system of order greater than two. These examples illustrate the impor- 
tance of a technique for obtaining essentially explicit solutions of 
periodic variable-parameter linear systems. Solutions in terms of char- 
acteristic exponents are known to exist for systems of linear differential 
equations with periodic coefficients.” However, the methods usually 


1275 


1276 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1962 


employed for solving such systems, such as power-series techniques, 
iterative processes, and incremental numerical solution methods, fail 
to provide a system response description valid for all values of the 
independent variable (time, distance, etc.). 

The analysis method to be presented below provides an essentially 
explicit solution for periodic variable-parameter linear systems of arbi- 
trary finite order. The solution describes the system behavior for all 
values of the independent variable. Emphasis will be placed on obtain- 
ing a set of basis functions for a homogeneous system, since the solution 
in the inhomogeneous case can be obtained from the basis functions. 
As shown by Darlington,’ these functions may be regarded as analogues 
of partial fractions in fixed network theory. 


II, FORMULATION OF DIFFERENTIAL SYSTEM 


In this discussion the system of equations to be solved will be repre- 
sented by the vector differential equation 


Ft) = BO)F(t) | (1) 


where F(t) and B(¢) are nth-order column and square matrices, re- 
spectively, and the prime denotes differentiation with respect to the 
independent variable ¢. It is supposed that the elements of B(t) are 
known functions of ¢ with a common period of unity, i.e., 


Bit) = Bit +1). (2) 
The formulation of this problem in (1) is chosen not only for its 
elegance, but also because of its practical advantages. As indicated by 
Kinariwala* these include the ability to write such an equation directly 
from a time-variable network, the fact that the eigenvalues of B(¢) are 
natural frequencies for stationary networks, and the convenience of (1) 
in obtaining the quadratic forms representing stored energy and dissi- 
pated power in stationary or nonstationary cases. These advantages have 
their translated versions in other physical problems, including multi- 
mode waveguide problems. Moreover, an equation such as (1) is 
easily obtained from an nth-order linear differential equation, but the 
transformation from (1) to such an equation can be quite difficult (or 
analytically inconvenient).* Thus, (1) represents a well-founded be- 
ginning for the analysis of variable-parameter problems of practical or 
theoretical interest. 


III. FORM OF SOLUTIONS 


‘The form of solutions of (1) is well known;’ pertinent properties of 
such solutions will be reviewed here briefly. If B(t) is piecewise con- 
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tinuous (1) has the unique solution 
P(t) = X(t)F(0) (3) 
where X(t) is the unique nonsingular square matrix satisfying 


X’ = BX 


X(0) = J = diag {1}. ) 
When B(¢#) satisfies (2), X(t) may be written as 
X(t) = J(t) &** (5) 
where 
J(t) = Jit +1) (6) 
and 
e* = X(1). (7) 


Tor convenience it will be assumed here that the eigenvalues of K 
are distinct, or at least that K can be diagonalized; thus, a constant 
nonsingular matrix P exists so that 

K = PMP" (8) 
where 
M = diag {u,} (9) 


and the constants yu; are the eigenvalues of K. The matrix exponential 
function in (5) may be similarly diagonalized, so that the solution (3) 
may be constructed in the form 


F(t) = J(t)Pidiag {e"*)]PF(0). (10) 
By establishing the special initial conditions 
0 


0 


F;(0) =P (11) 


1 <— 7 row 
0 
the corresponding unique solution . 

F(t) = eS (t)F (0), (12) 


is obtained from (10). Thus, by proper choice of initial conditions a 
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set of n solutions of the form 
F(t) = e'Q(t) (13) 


where yp is a scalar constant and Q(t) is a column matrix with period 
unity, have been shown to exist. 

The n solutions in the form (12) or (13) represent natural modes of 
the periodic system described by (1) and (2). If the n values of yu; are 
distinct the corresponding n solutions are certainly independent and 
form a set of basis solutions of (1). Any other solution of (1) comprises 
a linear combination of solutions like (12) or (18). Moreover, as Dar- 
lington® has pointed out, these natural-mode solutions are essentially 
unique because of their simple form. Hence the natural modes given by 
(12) represent a complete and essentially unique description of the 
natural behavior of the periodic system. The eigenvalues u; , frequently 
referred to as characteristic exponents, play a role analogous to response 
poles or natural frequencies of stationary systems. The strength of each 
natural mode in the homogeneous case is determined by the initial 
conditions and the constant matrix P. Moreover, the natural-mode 
solutions allow a complete solution to be calculated in the inhomogene- 
ous case. Thus, the determination of n corresponding solutions for yu 
and Q(¢) in (18) is central to the problems associated with (1) and (2). 

The object of the present treatment is to indicate a technique for 
determining the characteristic exponents yu, as well as the corresponding 
matrices Q(t) if desired. Primary attention is given in finding the char- 
acteristic exponents u because of their practical importance and because 
the solution for Q(¢) is not greatly difficult in principle if the appropriate 
characteristic exponent is known. Solutions for Q(t) are mentioned in 
Appendices A and B. . 

The method to be discussed resembles the technique used by Hill’'® in 
solving the second-order equation 


z(t) + A(t)x(t) = 0 (14) 


where A(t) is periodic. It will be shown that the characteristic exponents 
may be determined from roots of either a transcendental or polynomial 
equation in which certain infinite-order determinants enter as parame- 
ters. A technique similar to Hill’s was employed by H. von Koch in the 
last century to provide an explicit solution in terms of infinite-order 
determinants for a general nth-order linear differential equation with 
periodic coefficients. This technique is carefully discussed by Forsyth’ 
and Riesz,* who also give references to von Koch’s original papers. Thus, 
the method presented here, although developed independently, does 
not solve an unsolved mathematical problem when applied to a periodic 
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variable-parameter system described by an nth-order linear differential 
equation. It does, however, solve the stated problem in a way that 
appears to have several advantages, mostly associated with its formula- 
tion as a system of n simultaneous first-order linear equations. These 
advantages, already mentioned in Section IJ, seem likely to make the 
present solution technique more useful in the analysis and synthesis 
of periodic variable-parameter systems than one based entirely on the 
classical nth-order linear differential equation. 


IV. INTEGRAL FORM FOR THE PERIODIC SYSTEM 


The analysis of the periodic system begins by multiplying both mem- 
bers of (1) by e”, where a is an arbitrary constant, and adding and 
subtracting aFe “’ to yield, whenever F’ exists, 


(Fe) + aFe“ = BFe™. (15) 
Integration of (15) results in the integral equation 
Fe“ +a] Fo“d = | Bred + C (16) 
where C is a constant. Any solution of (15) is also a solution of (16); 
thus, let / be a solution given by (13) and let 
a= p+ 72rk (17) 


where k is an arbitrary integer. Equation (16) becomes 
QeP™ + (n+ jk) | Qe Rae = f BOT a + C. (18) 


If (18) is evaluated at ¢ = 0 and ¢ = 1, and the results subtracted, the 
first term in (18) makes no contribution, being periodic. Hence, (18) 
implies 


1 1 
(u + j2ak) | Qe?" dt = / BQe?™ dt (19) 
0 0 


for all integers k. It will be seen below that this integral equation suffices 
to determine » and Q(t), which are essentially eigenvalues and eigen- 
functions. 


V. MATRIX DIFFERENCE EQUATION 


To make use of (19) in finding solutions of (1) it will be assumed that 
the given matrix B(t) and the solution matrix Q(¢) may be expanded in 
the lourier series 
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B(t) = s Bee (20) 
and 
Q(t) = Do Qe" (21) 


where matrices B, are square matrices and Q, are column matrices. 
Requirements on the asymptotic behavior of the elements of matrices 
B, and Q, for large values of | p | will be discussed in Appendix A in 
relation to convergence of certain infinite-order determinants. The 
Fourier series for the matrix product BQ may be written as 


BQ = Y (BQ)ye™ (22) 


p= 


in which the column matrices (BQ), are given by the convolution 


(BQ) >» a ps B _Q),. (23) 


Iixcept for a factor of 27 the integrals in (19) express the Jourier 
coefficients of Q and BQ. Thus, if Q and BQ possess Fourier series (19) 
is equivalent to the infinite set of linear equations 


(u + j2mk)Qr = (BQ): (24) 
or 
(u + i2rk)Q: = Yo BesOs, (25) 


where k assumes all integral values. Equation (25) might be regarded 
as a matrix difference equation for Q, ; however it is more convenient 
here to consider (25) as defining an eigenvalue problem for an infinite 
matrix. In terms of Kronecker’s 6, (25) is 


0= pa [By_+ —_ Sir (u =e j2ak) 1\Q, (26) 


where J is the nth-order unit matrix. The expanded form of (26) is 
shown in the following infinite-order matrix equation, in which the 
first matrix is partitioned into n X n size blocks and the second into 
n X 1 size blocks. The “origins” of the matrices fall at (By — uJ) and 
Qo. 


SYSTEMS OF LINEAR DIFFERENTIAL EQUATIONS 1281 


*» Bo— (u— jar) Bu B_s Qs 0 
By Bo—(u—j2m)I Bu Bis Q-1 0 

By Bi By — pl Bu Bs Qo | =|0 

Bs By Bor (ut+92x)I Bu Qt 0 

By Bi Bo — (ut j4m)I -++|| Qe 0 











(27) 
For convenience it will be assumed that Bo is a triangular matrix so 
that its eigenvalues appear explicitly as main diagonal elements. To 


show that a constant linear transformation of the dependent variable 
F can always produce this property, let 


Bit) = By) + A(t) (28) 

where A(t) has a zero mean, and let 
X = PF (29) 
where P is a nonsingular matrix of constants. Then (1) is transformed to 
| X’ = (PBP™ + PAP )X. (30) 


This equation has the same form as (1), but the constant term in its 
coefficient matrix is the matrix PB)P' derived from By by a similarity 
transformation. It is well known that a square matrix is reducible by a 
similarity transformation to the classical canonical form having eigen- 
values on the main diagonal and possibly nonzero constants in some 
positions of the next higher diagonal.’ (These constants cannot appear 
if By has distinct eigenvalues; hence, Bo can often be assumed to be 
diagonal.) The matrix Bo can also be reduced to a triangular form by a 
similarity transformation in which P is a unitary matrix.’’. This re- 
duction, which is always possible, may sometimes have advantages in 
studying energy functions or related quadratic forms. Thus, by either 
technique By can be reduced to triangular form. It will be assumed that 
such a transformation has been effected in obtaining (1). 


VI. CONVERGENT LINEAR EQUATIONS AND INFINITE-ORDER DETERMINANT 


To produce convergence of the determinant of coefficients of the 
infinite set of homogeneous equations defining Q, Equations (26) or 
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(27), divide each elementary row of the coefficient matrix in Equation 
(27) by its main-diagonal elements. When Bp is diagonal this process is 
identical with multiplication of each equation in Equation (26) or 
each matrix row in the square matrix of Equation (27) by the diagonal 
matrix [By — (u + j2rk)I]|’. In general, let the matrix A be defined by 


A = diag {),} (31) 


where i, represent the n eigenvalues of By. Then the set of equations 
with convergent determinant may be written as 


[Mi|[Q-] = 0 (32) 
where submatrices M;, are given by 
Mix = [A — (# + j2ak)IT"[Bo — (u + j2ak) I] (33) 
and 
Mi, = [A — (u + j2ak) I) 'Bi_- , k#r. (34) 


When By is diagonal, My reduces to the unit matrix. Thus, a typical 
determinant d[M/;,] of (32) may be illustrated for n = 2 by the following 
scheme: 




















1 a_i b_1 a_2 b_» 
M+j2r—p M+tJ2r—p M+7J2r—p M+j2r—p 
0 C1 d_x C2 d_»s 
Not y2ar—p rot jJ2r—p Art J2r—p rot Jj2a—p 
ay by 1 0 a1 ba 
M — B AL — Yt A — Bw Al — pb 
d(u) = 
Cy dy 0 1 C_1 d_1 
Ae B Ae — Bw Ag — yb A2 — 
a2 by ay by 0 
M—J2r—p M— jr — pw M—j2r—p M—j27—p 
Ce dy C1 at 0 1 
Ne— J2r— pb Ne—j2r—p ho—j2r—p Ne— J2r—p 
(35) 
The notation used in (35) is 
Ap by 
By , p#od0 (36) 
bp dp 


F 
By = (37) 
QO re 
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The determinant of the infinite-order matrix [M;,] of (32), illustrated 
by (85) for n = 2, will be denoted by d(x) to show its functional de- 
pendence on the argument yu. The function d(z) is actually a determi- 
nant of infinite order. If this determinant converges it represents a 
function of » which must vanish in order to obtain nontrivial solutions 
for Q, in (32). Requirements necessary for the convergence of d() are 
discussed in Appendix A, where it will be shown that d(u) converges 
for a large class of problems. Hence the basic equation 


d(n) = 0 (38) 


defines the characteristic exponents of the differential system (1). 


VII. FUNCTIONAL EXPRESSIONS FOR THE CHARACTERISTIC DETERMINANT 


Equation (38) taken alone is rather unwieldy, involving as it does 
the equation to zero of an infinite-order determinant whose elements 
are functions of uw. However, it will now be shown that expressions for 
d(u) in terms of elementary functions may be written to allow a simple 
solution of (38). 

The determinant d(u) is shown in Appendix A to converge for all 
values of » except those for which the denominators of rows of d(z) 
vanish. Multiplication of one row of an infinite-order determinant by any 
scalar is equivalent to multiplication of the determinant by the same 
scalar. Similarly multiplication of any row of d(u) by its corresponding 
denominator \, — (u + j2ak) produces a determinant convergent at 
Ap = H+ 727k, so that each row of d(u) introduces exactly one pole 
in d(u). Moreover, d(u) is periodic in » with period 727, since replacing 
uw by w + g2m only shifts the origin of the infinite-order determinant. 
Evidently d(u) has simple poles at 

= Ap + 7279, p =1,2,---,n q integral. (39) 
It will be assumed for the moment that these poles are distinct; this 
restriction may be relaxed slightly, as shown in Appendix B. Finally, 
as pw approaches infinity along any radial line in the complex yu plane 


except a vertical line, the off-diagonal elements in d(yu) tend toward 
zero, or briefly 


d(o) = 1. (40) 
The periodicity of d(u) implies that the residue of d(u) at any of the 


poles in (39) is independent of the particular integer g. Thus, a formal 
expansion of d(u) in partial fractions is 


n 0 K 
d(u) =Ke+ 2) 2, —*2-. (41) 
p=l g=—~ W — Ap — J27q 
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According to the Mittag-Leffler theorem” this expansion defines the 
function 


d(u) = Ko + De K, coth (“= ») (42) 


A relation fixing K, may be derived from (40) by noting that as pz 
approaches infinity along any nonvertical radial line 





fiaeork (" = ) 4 (43) 
pow 2 
so that 
sino ke (44) 
2 p=l 


To compute the residues K, the well-known rule 


Ky = Le (u — Ap)d(u) = [CH — Ap) dH) uma, (45) 
Np 
is employed. The procedure is simply to multiply every element in the 
row of d(u) containing 4, — yp (in the denominators) by the factor 
(u — \,) and to evaluate the resulting determinant. For example, in 
the case of n = 2 used above for illustration, the row of d(u) containing 
\1 — w in the denominators is replaced by 


* —As — by —ay —b,0 0 —Aa_1 —b_y —A_2 —b_» ae (46) 


and the resulting determinant evaluated at u = 1. Reasonably accurate 
and efficient methods for computing K, from such a determinant can 
be programmed just as for Hill’s determinant in the second-order case. 
Such a technique is discussed briefly in Appendix C. 

It is well known that the solution of Hill’s equation generally requires 
the evaluation of only one infinite-order determinant, while the solution 
of a second-order problem using (38) and (42) appears to require the 
evaluation of two determinants. Actually it will be shown that only 
n — 1 determinants need be calculated for an nth-order system of equa- 
tions. In addition (42) may be simplified because of the relation among 
the residues K, to be demonstrated below. 

To examine the poles and zeros of d(y) it is convenient to consider 
the complex » plane divided into horizontal strips of width 27. The 
poles of d(u) fall at \» + j2rq. Although the eigenvalues \, may lie 
in any of these strips, values of g always exist to give one pole in the 
fundamental strip 0 = Im uw < 27 representative of each \,. Hence 
d(») has exactly n poles in each strip. It will be seen shortly that d(u) 
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also has n zeros in each strip so that a pole-zero constellation for d(.) 
might be illustrated by Fig. 1. 
The desired relation among the residues K, is obtained by noting that 


na 


[awd = Ky (47) 


where the integral is taken around the rectangular contour abcd shown 
in Fig. 1 (or a congruent rectangle vertically displaced if a pole happens 
to fall at Im up = 0). The periodicity of d(u) insures that the contri- 
butions to the integral from the horizontal sides ab and cd will cancel. 
The vertical sides be and da are supposed to be displaced from the 
origin far enough to include all n poles in the rectangle so that (47) is 
valid. As their displacement approaches infinity the value of d(u) ap- 
proaches unity and the contributions from the vertical sides tend to 
cancel. Thus (47) implies 


> Ky = 0. (48) 
p=1 
This relation shows that (44) and (42) may be simplified to 
Kae 1 (49) 
and 
ly be — Ap 
d(u) =1+=)>> K,coth{——). (50) 
2 p=l 2 
It also allows one residue to be computed from the other n — 1, al- 
ae (tL PLANE 





X POLES 
© ZEROS 


Fig. 1 — Pole-zero constellation. 


1286 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1962 


though all n residues might be computed in practice and (48) used as a 
check for numerical accuracy. 

Equation (50) expresses the characteristic determinant d(,) 
terms of the eigenvalues \, of the stationary part of the system and the 
residue determinants K,. The characteristic exponents » are thus by 
(38) and (50) the roots of the trigonometric equation 


sp ee aot (“= ), (51) 
2 p=l1 2 


This trigonometric equation represents an explicit solution of the prob- 
lem of finding characteristic exponents for an nth-order periodic system. 

It is evident from (50) as well as from lig. 1 and the periodicity of the 
function e* that the substitution 


ee (52) 


reduces (50) to a rational function in z. Zeros and infinities of z do not 
introduce superfluous poles or other singularities in this function be- 
cause of (43). Thus, the poles and zeros of this rational function are 
mapped by (52) into the poles and zeros of d(w) shown in Fig. 1. Any 
strip of vertical width 27 in the uv plane is mapped by (52) into the entire 
z plane so that the rational function of z has n poles in the z plane. The 
number of zeros of the rational function is also necessarily n. Hence, 
d(u) has precisely n zeros in any horizontal strip of width 27 in the u 
plane. 

Because of the existence of well-developed techniques for polynomial 
manipulation, such as approximate solution methods, interpolation 
formulae, and stability criteria, it is practically convenient to utilize 
(50) and (51) in rational form. Accordingly let z be defined by (52) and 
Lp by 


tp = e?, p=1,2,-:-,n, (58) 
so that d(u) is transformed to 
D(z) =1+5 52, Ky x2 + 22) = d(log z). (54) 
Further, let 
g(e) = I (@ - a) (55) 


be a characteristic polynomial defining the eigenvalues of the stationary 
part of B(t). (This “characteristic polynomial’ differs from the con- 
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ventional one in that its roots are e? rather than A, .) Equation (51), 
the characteristic equation, then becomes 


0 = f(z) + g(2) (56) 
where 
fe) =} Ke +2,) IT @ - 2). (51) 
ap 


These equations demonstrate that the characteristic polynomial for 
the periodic system is obtained by adding a certain interpolating 
polynomial to the characteristic polynomial of the stationary part of 
the system. The behavior of the interpolating polynomial is prescribed at 
the roots of the stationary part. 

The interpolating polynomial f(z) has the n assigned values 


f(%p) = Kyty I] (tp — Xq); (58) 
ip 


because of the relation (48) among the residues K, the polynomial 
(57) is identical with the Lagrangian interpolating polynomial 


fle) = 2 Keay TI (@ =a). (59) 
aA p 


Evidently the interpolating polynomial f(z) is the unique polynomial 
of degree n — 1 having the assigned values (58). Thus f(z) + g(z), the 
characteristic polynomial of the periodic system, is the unique monic 
polynomial of degree n having the n assigned values given in (58). 
This point of view may give some insight into stability questions. 
For example, the classical criteria of Routh and Hurwitz, and other 
results on bounds of zeros of sums of polynomials may be useful here. 

If all the residues K, vanish, as in the stationary case, the limiting 
values of the characteristic exponents obtained from (51) and (56) 
are u = Ay», p = 1,2, ---, nm. In cases of small variations where all 
| K,| are small the characteristic exponents differ very little from the 
eigenvalues of the stationary part. Asymptotically they may be calcu- 
lated from any of the approximate equations 


wit 5K coth (“=™) (60) 


z~a,(1 — K,) (61) 
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or 
1p hyp — Kp- (62) 


Although perturbation type solutions such as (62) probably are more 
easily calculated by less complicated techniques, characteristic expo- 
nents obtained from (61) or (62) may be useful as starting values for 
solving (51) or (56) by numerical methods. 


Vill. EXAMPLE 


The following example illustrates the technique for finding charac- 
teristic exponents. A second-order case is chosen for convenience 
because some digital computer programs needed for the efficient evalua- 
tion of the residue determinants are not yet available. However, higher- 
order examples are not different in principle nor will they require in- 
ordinately longer computations. 

The Mathieu equation 


he 
a + (3 — 4 cos 2z2)y = 0 (63) 
has the solution” 
y = oe? y Cra ei @rtbe (64) 
with 
B = +&0.57943224.-- . (65) 
In vector form this equation is equivalent to 
0 1 
Y’=fnr Y (66) 
—3+4cos2nrt 0 
with the identifications 
Yy 
Y= | Yi = Y; z= at. (67) 
Y2 


Diagonalization by the transformation PY = F where 
1 jV3 1 
P=>——= = (68) 
J2V3) _ jin/3 1 
yields (1), with 
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ii ae) 2 
res fe 24 € + yrvV3 ee 


gant Soe, 
z= aes 
iV3{4 -1 


The determinant d(u) has the form shown by (35), and the residue at 
pw = jrv/3 is approximately K = 70.562096, a result obtained from a 
42nd-order approximant. From (56) and (57) the characteristic ex- 
ponents are solutions of 


(69) 


cosh p = cos 71/3 — jK sin 1/3, (70) 
which yields, for K & 70.562096 
XY +70.42059r. (71) 


Corresponding correct values of » from (64) and (65) are +470.420577. 
A somewhat longer computation would be required to produce a re- 
sult accurate enough for certain purposes. Such a computation was not 
employed here because a more fundamentally sound computing tech- 
nique for band-limited periodic variations as in (69) would exploit 
the form of the residue determinant and its large number of zero ele- 
ments. Specifically, it is possible to program a determinant evaluation 
technique for such cases so that the computation time is asymptotically 
proportional to the order of the truncated determinant rather than to 
its cube. This possibility is discussed further in Appendix C. 


IX. CONCLUSIONS 


A method has been developed for analysis and calculation of solutions 
of nth-order linear periodic differential systems. The system description 
employed is a set of n simultaneous first-order linear differential equa- 
tions. The method allows the determination of characteristic exponents 
from polynomial equations the coefficients of which are linear combina- 
tions of n—1 convergent infinite-order determinants. Approximate 
computation of the determinants is feasible for problems of finite order. 
In addition to characteristic exponents the complete solutions may also 
be computed if desired. 


APPENDIX A 


Convergence 


The validity of the analysis presented here depends upon the con- 
vergence of the infinite processes employed. It must be shown that the 
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determinant d(u) and the Fourier series for Q(t) are convergent if the 
coefficient matrix B(t) is suitably restricted. For this purpose (32) may 
be written as the infinite set of scalar equations 


x; + po Aye = 0 (72) 
?7=-e 
where a;; and 2; are scalars, and the equations hold for all integral 7. 
The coefficients a;; actually are elements of the submatrices M;,,, and x; 
elements of submatrices Q, in (32). The determinant of coefficients of the 
scalar equations is 


d(u) = | di; + ai; |. (73) 


According to a theorem of St. Bobr™ this determinant is absolutely 
convergent if 
De | ais | (74) 


and 


0 00 p—l 
o, =. | ai ae (75) 


ivi 
converge for some value of p in the interval 1 < p S 2. (For p = 2, 
the case used here, the theorem was given by von Koch.) The expression 
in (74) obviously converges to zero since all a;; in (72) are zero. Let the 
elements of the given matrix B(t) be square integrable functions. Then 
Parseval’s relation applies and the Fourier series coefficients for the 
matrix elements are surely square summable. Hence, the inside sum 
in (75) converges for p = 2. The outside sum also converges for 
p = 2, since its general term is asymptotically proportional to 7° 
for large |7| (as (83) and (34) indicate by their dependence on k). 
Of course, an exception occurs for values of » given by Equation (39). 
The determinant d() is singular at these points, but the convergence 
of the residue determinants K, for simple poles is assured by St. Bobr’s 
theorem. Thus, d(u) converges absolutely and uniformly except for u 
arbitrarily near \, + j27q and has poles at these values of y. 

Since the determinant d(u) has zeros at any of the n characteristic 
values of » within the strip 0 S Imp < 27, the deletion of the zeroth 
equation (7 = 0) from (72) and the transposition of aio% in each equa- 
tion produces a nonzero determinant of coefficients in the equations 


iP Da Ajjl7 = —Ajoto = Yi, i ~ 0. (76) 
j= 
jA0 
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These equations with a;; evaluated at a characteristic value of » have 
as solutions the scalar quantities x; needed to produce the matrices 
Q, and thus the matrix Q(¢). That meaningful solutions to (76) exist 
for an arbitrary constant 2 is shown by a theorem of L. W. Cohen.” 
This theorem (paraphrased) states that if (75) converges for the 
coefficients in (76), if the (convergent) determinant of (76) does not 
vanish, and if 


Ewe 
converges, then the solutions exist, may be obtained by Cramer’s 
rule (with infinite-order minor determinants), and have the property 
that 


ie] 


DS aa)? 


i=—0o 


converges. Thus, if the elements of the given matrix B(t) are square 
integrable functions, the coefficients aj, are surely square summable, 
and the resulting trigonometric series for elements of Q(t) have square 
summable coefficients. The Riesz-Fischer theorem” then states that the 
elements of Q(t) are square integrable functions with Fourier coefficients 
given by the elements of Q, and that the Fourier series for Q(t) con- 
verges to Q(t) in the mean. (Consequently there exists a sequence of 
partial sums of the Fourier series converging to Q(t) ‘almost every- 
where.” ) 

In a more restricted case which might have more practical importance, 
it may be shown that if B(¢) is continuous so that the elements of B, 
are 0(1/p’), the solution matrix Q(t) has the same property. Of course, 
the Fourier series for Q(t) converges absolutely and uniformly in this 
case. This convergence condition and the more general one above demon- 
strate that the analysis technique is valid for a wide class of problems. 


APPENDIX B 


Multiple Poles of the Characteristic Determinant 


If the matrix By, the stationary part of the coefficient matrix B(t), 
has repeated eigenvalues, or if any of its eigenvalues differ by integral 
multiples of 727, some denominators of rows of d(w) are identical. In 
this case d(u) has multiple poles, and the necessary analytical and 
computational procedures become more complicated. It is possible to 
treat the case of a single second-order pole of d(u) by evaluation of n — 1 
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determinants as before, but greater multiplicities require considerably 
more extensive calculations. 

When d(u) has an m-fold pole at » = d, the partial fractions expan- 
sion of d(u) must contain the corresponding principal part of d(u). 
The coefficients in the principal part involve derivatives of (u — 1)” 
d(u) evaluated at » = d,. These derivatives are more difficult to 
compute than the residue determinants of the simple case because they 
are linear combinations of most of the first minors of d(u). The com- 
putation of such minors (not necessarily by direct methods) is also 
required if Q(t) is to be determined (even when d(u) has only simple 
poles). Appendix A shows this computation to be theoretically possible; 
it is equivalent to the inversion of a set of equations like (76). Never- 
theless, the computation effort would be considerably greater than that 
required for computation of characteristic exponents when d(y) has 
only simple poles. 

When d(u) has a single second-order pole, (48) may be utilized to 
make possible the calculation of characteristic exponents. It is conven- 
ient here to use the rational form of (54) for the infinite-order determi- 
nant d(u). Let the repeated roots of Bo be identified with \;, \2. and 
21 , X2 respectively. Define a; and az by 


aq = Ky(a a Xo) 








(77) 
ay = Ke(t2 — 21) 
and allow x; to approach x2 . Substitution of (77) in (54) yields 
De) = 143 (28) a 
1 2 - ; 2 ; " (78) 
Qy On 7 i & Ly 
ewe ay oo ( = a) 
Equation (48) may be written as 
a+ >) K,=0 (79) 
v1 — Xe =3 
so that 
L+ >> lim K, = 0 (80) 
p=3 21-22 
where 





L = lim (2 = “) = lim (K, + K2) (81) 


zyorg \h1 — Le ZZ 
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is a finite limit. Evidently, as x, approaches 2 , 


lim D(z) =1 +h(ite) 4 oi 4} > K, (2 +2). (82) 
= rah 


2129 Xe (2 — 22)? ae 








The zeros of this limiting form of D(z) correspond to the characteristic 
exponents in this case. The parameter a: may be determined by factor- 
ing 1/(A2 — x) from the appropriate row of Kz and computing the 
resulting determinant A, since 


7 V2 — UW 
= A-lim NN 83 
en apes (}: = °) 
The parameter Z required in (82) may be computed from (80). Cases 


where two poles of d(u) are almost coincident may be treated in a 
similar fashion, except that no limits are involved. 





APPENDIX C 


Approximate Computation of Residue Determinants 


In practical cases where the number of terms in the I*ourier series for 
B(t) is limited, truncated approximants to the residue determinants may 
be evaluated by techniques that exploit the special form of these de- 
terminants. The form of a truncated residue determinant is illustrated 
by the scheme in Fig. 2, in which all elements outside of the shaded 
region are zero. Except for one submatrix near the center of the array 
the principal diagonal blocks represent nonsingular triangular sub- 





Fig. 2 — Form of truncated res.due determinant. 
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matrices. To demonstrate the feasibility of computing truncated residue 
determinants of large order it will be shown that the computation time 
required for a reduction to triangular form is much smaller than for a 
general determinant of the same order. (The computation time required 
for a general determinant is asymptotically proportional to the cube of 
its order. ) 

To evaluate the determinant in Fig. 2 let zeros be produced below 
submatrix 1 by elementary operations with the rows passing through 1. 
Similar operations to produce zeros below 2 do not disturb the zeros 
already produced. Such operations may be continued in the usual manner 
to produce zeros below 8, 4, etc., until a triangular array of submatrices 
is realized. The number of arithmetic operations necessary in each step 
of zero production is essentially dependent only upon the order of the 
original system of equations and the number of terms in B(t). Observa- 
tion of Fig. 2 shows that the number of zero-producing steps for a 
truncated determinant of large order is asymptotically proportional to 
the order of the determinant. Thus, the computation time required for a 
reduction to triangular form is also asymptotically proportional to the 
order of the truncated determinant to be evaluated. 
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Prolate Spheroidal Wave Functions, 
Fourier Analysis and Uncertainty —III: 
The Dimension of the Space of 
Essentially Time- and Band-Limited 
Signals 


By H. J. LANDAU and H. O. POLLAK 
(Manuscript received February 23, 1962) 


The purpose of this paper ts to ecamine the mathematical truth in the 
engineering intuction that there are approximately 2WT independent 
signals ¢; of bandwidth W concentrated in an interval of length T. Roughly 
speaking, the result ts true for the best choice of the yg; (prolate spheroidal 
wave functions), but not for sampling functions (of the form sin t/t). 
Some typical conclusions are: Let f(t), of total energy 1, be band-limited 
to bandwidth W, and let 


T/2 
[ | fC) | dt =1— ep. 
_T/2 


Then 


00 [2wWT]4+Nnv 2 


fi) - oS Ang, | dt < Ce> 
0 


inf 


{az} Y-% 





as 

(a) true for all such f with N = 0, C = 12, af the gn are the prolate 
spheroidal wave functions; 

(b) false for some such f for any finite constants N and C if the ¢, are 
sampling functions. 


I. INTRODUCTION AND SUMMARY OF RESULTS 


Intuitive considerations based primarily on the sampling theorem 
have for a long time suggested that the space of signals “essentially” 
limited in time to the interval | ¢| < 7/2 and in frequency to (—W,W) 
cycles is “essentially” 2W7-dimensional. It is the object of the present 
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paper to investigate this problem thoroughly. The first step in the proc- 
ess is to see how the above statement may be made precise. The two 
main difficulties to be overcome in even formulating some mathematical 
problems in this area are contained in the two uses of “essentially” 
above: What shall we mean by “essentially” limited in time and fre- 
quency, and what can we mean by “essentially”? 2W 7-dimensional? 

Suppose that a function f(é) is actually band-limited. It is then an 
analytic function of the complex variable t, and cannot vanish in | ¢| > 
T/2 without vanishing identically. We will therefore think of f(t) as 
approximately time-limited to |t| < 7/2 if a large fraction of its energy 
is contained in that interval, that is, if 


[ lropa 
(0.1) Vleet =l1- en 
[lim ea 


where er will, in much of our thinking, be small; e7 shall be used as a 
measure of the degree to which f(t) fails to be concentrated on the 
interval |t| S 7/2. We will denote by (er) the set of band-limited 
functions f(t) satisfying (0.1) with the further normalization for con- 
venience that 


[1 s@ Pat =1. 


We should point out here that, by previous results,’ 7 and er are re- 
lated: as er becomes small, 7 must grow indefinitely. 

We have now defined our set of functions; how can we speak pre- 
cisely about its dimension? Her) is certainly not finite-dimensional 
for any er > 0, for there is no finite set of functions whose linear com- 
binations exactly express each f(t) in H(er). We will, however, say 
that EH(er) is approximately N-dimensional if there exist N linearly 
independent functions go , --+ gy_1 whose linear combinations approxi- 
mate each f(t) in H(er) to within a small fraction of its energy, that is, 
if 


fi) - pa awi(t) | dt < 6x, 





(0.2) min [ 
{aj} “—@ 


where we shall usually think of 6y as small. Again, 6y may be used as 
a measure of the degree to which E(e,) is N-dimensional. 

In the above definition of the approximate dimension of H(er), we 
have complete freedom in choosing the “basis”. functions go --- gy_1 
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with which we will attempt to approximate f(¢). There are two different 
objectives we may have in choosing the ¢g;. For real understanding of 
the dimension of E(er) we must use the ¢; which best approximate 
E(er), in the sense of making the error, represented by the left side of 
(0.2), as small as it can possibly be over the whole set (er). Alterna- 
tively, for practical purposes, we may wish to use the simplest available 
functions, and see how close we can come with them. Thus there is 
considerable interest in pursuing two lines of investigation: 

(z) Let us first try to identify the best functions ¢; to use, that is 
the functions which achieve 

co N=1 2 

(0.3) min max min [ f(t) — >> aw;(t) | dt. 
eaN 4) segcer) {ay}Q7t —* : 





Once we have found these best functions, what is the relation between 
the number N of such functions, the measure of concentration e7 , and 
the achievable degree of approximation dy ? 

(at) If we pick for the ¢’s sampling functions, i.e., functions of the 
form [sin r(2Wt — r)]/[x(2Wt — r)], what is now the relation between 
N, er, and by? 

It turns out that the answers to (7) and (i) are rather different, 
that is, the degree of approximation achievable by sampling functions 
is in a very real sense poorer than the degree achievable by the besé¢ 
basis functions. And yet the solutions of the two problems are, as we 
shall see, remarkably intertwined. 

In order to give a detailed picture of our results, it is necessary to 
summarize some of the previous work on time- and band-limiting which 
has appeared in Refs. 1 and 2. 

The space £° of square-integrable functions on (— «,) forms a 
Hilbert space in which the inner product (f,g) is defined by 


Gia) = [ sO GW a 


the norm squared of f, || f ||’, is defined by 


Ilr = Gh), 
and is just the total energy. Two functions f and g are orthogonal if 
(fg) = 0. 


To any closed subspace there corresponds a projection operation P, 
which assigns to every function its orthogonal projection onto the 
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subspace. Projections are characterized completely’ by the properties 
P is self-adjoint, and 

(0.4) ; 
P* = P, 


We single out for consideration two projection operators on the 
space of square-integrable functions: time-limiting and band-limiting. 
Time-limiting a function f produces a function Df which is f restricted 


to |¢| s 77/2: 
r= | if |¢| P/2 
0 if |¢>>°7/2 


IIA 


We shall write Df if the specific interval is important to the discussion. 
Band-limiting a function f produces a function Bf whose Fourier trans- 
form agrees with the Fourier transform of f for | #| S 27IV, and van- 
ishes for |w| > 2xrW. If 


Flo) = [ f(s) &* ds, 
1 2rWV . 
Bf = ao F(a) e* do, 
2a on 


or, in terms of f directly, 
Bf = | f(s) sin 2xW(t — s) as 
T J—0 t—s 


The subspace of functions f in £* which are already time-limited, 
i.e. for which Df = f, will be called \, and similarly band-limited func- 
tions, for which Bf = f, the subspace @. The observation made pre- 
viously that a band-limited function which vanishes for |¢| > 7/2 
must vanish identically may now be phrased as 


BND = {0}. 


A major result in Ref. 1 was that there is actually a non-zero minimum 
angle between the spaces @ and O. 

A doubly orthogonal system of band-limited functions y, was in- 
vestigated in Refs. 1 and 2, and a number of properties were derived. 
The following are important to our development: 

Given any T > 0 and any W > 0, we can find a countably infinite 
set of real functions Wo(t), Yi(t), ¥2(¢), --- , and a set of real positive 
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numbers 
No > Ar > de > eee, 


with the following properties: 

(t) The y;(t) are band-limited, orthonormal on the real line, and 
complete in the space of square-integrable band-limited functions of 
bandwidth W cycles. 

| 0,7 4 
(vi, vi) = 1,4 = 0,12; . 
Li=j 


(iz) In the interval —7/2 S$ t < 7/2, the functions Dy;(t) are or- 
thogonal and complete in the space of square-integrable functions van- 
ishing for | ¢| > 7/2. 


Oe 4g 
(Dy; , Dy;) x a,j = 0,1,2, oe 
Nt EI 


(iz) For all values of ¢, real or complex, 


dads = BDYs (= vats) SZC 9) ge) 


We shall write \;(7') if the specific interval is important to the dis- 
cussion. 

We are now in a position to give an account of our results. We repeat 
our basic definition: 
E(er) ts the set of functions f(t) € & such that 


(1) fe @B 
(2) |[ fl] = 1 
(3) || Df|P =1—e. 


Let us turn to the approximate dimension of (er). As we pointed 
out above, the basis {y;}¢ which we wish to use is the one which mini- 
mizes (0.3), that is, which minimizes 


N 2 
‘oes 2s ivi 














_max min 
SEE) (ai) 


It seems reasonable that the best basis, for any given N, should be the 
(N + 1) linearly independent most concentrated band-limited functions, 
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and these are known, from previous work, to be wo, --> , wy. Although 
this seems to be harder to prove than one might expect, it is in fact 
true, and is the subject of 

Theorem 1. For any fixed N, the functions fo, ---, ww achieve the 
2 
min max min 
{eald feEler) {ai} 


N 
‘ies > AiPi 














Thus results on the approximation of (er) by linear combinations of a 
finite number of y,; are in fact best possible results on the approximate 
dimension of (er). 

Theorem 3. Let f(t) € E(er). Then* 


| [2w rT] 2 


2 
< Cier ) 


\ > AnWn 








where the a, are the Fourier coefficients of f in its expansion tn the y’s, 
and C, 1s independent of f, er, and 2WT, and may be taken as 12. 

Theorem 3 shows that [2V7] + 1 of the best basis functions for 
E(er) suffice to approximate a concentrated function to a degree pro- 
portional to the “unconcentrated part” «> of the energy. We shall sce 
that this is no longer the case when we use the simpler sampling func- 
tions. 

In Theorem 3, as we have said, Ci may be taken as 12. What does 
it take to make C very close to 1, that is, to make the approximation 
almost as good as the concentration? First of all, it is important to see 
that roughly 2WT7 functions are not enough to do this, and this is the 
subject of 

Theorem 5. For any er < 0.915, there exists a function f € E(er) 
such that 


[2wr]—2 2 


inf || f — 2 a: 
where C, > 1 and R(WT) -0 as WT — o~. Here Cz may be taken as 
1/0.915 and R(WT) as 2\/%e7""". (If & > 0.915, the right side 
should be replaced by 1.) 

By further analysis, this result may be strengthened so that it in- 
cludes approximations by [2W7] + N of the y,; functions, where N 
is any finite integer. 

Theorem 8. For any given N and «> < 0.916, and for WT sufficiently 


* Iz] means the largest integer < x. 














2 C2(er —s R(WT)), 
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large, there will exist a function f € E(er) such that 


in > a (2 — 24/a eo), 
(If ep > 0.916, the right side should be replaced by 1.) 
Since, by Theorem 1, the y; are the best approximating functions in 
|¢| < T/2, Theorems 7 and 8 hold, a fortiori, for any approximate 
basis {g;}. 

What, then, does it take to bring the constant C of Theorem 3 arbi- 
trarily close to 1? We do not know the best possible result, but there 
is considerable information in the following theorem, due to C. E. 
Shannon: 

Theorem 4 (Shannon) : Given any n > 0, there exist constants C3 = C3(7) 
and C's = C4(n) so that for f € E(er), 


(2WT]4+C3 logt 2WT+C4 


f= 2d as 


(2WT1+N 


= dX aw; 


2 














2 
inf < (1+ n)er.* 


ay 














Thus a number of functions boundedly more than 2WT7 cannot suffice 
for approximating f € EH(er) to within (1 + 7) «>, but a logarith- 
mically growing extra number of terms does. 

Let us now turn to approximating (er) by sampling functions. 
The first result is that [2V7] + 1 sample functions will approximate f 
in energy roughly to within a constant times e7, that is, within a con- 
stant times the square root of the unconcentrated energy. The placement 
of the sample points depends on 2WT, but of course not on the specific 
function. 

Theorem 2. Let f(t) € E(er). Then, if WT — [WT] S 3, 


k \ sin r(2Wt — &) 
(a) [; 7 2! (ap) — e(2Wt — ky 


2 





IIA 


2 
mer + Er, 








and if WT — [WT] > 3, 


pe: (" sis *) sin r(2Wt — k — 4) 
[k+3|Sw7r 2W n(2Wt—k — 2) 


2 
2 
S wer + er. 


(b) 














An estimate valid for all WT may be obtained by replacing WT in (a) 
by WT + 1. 

We note that the coefficients f(k/2W) and f(k + 4/2W) are well- 
known to be the Fourier coefficients in the sampling series expansion, 
and hence the best constants to use. 


*logt « = max (log 2, 0). 
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This theorem is, in one sense, quite satisfactory because rer + ep 
does go to 0 as the unconcentrated part of the energy ¢; goes to 0. 
On the other hand, rer + «7 approaches 0 more slowly than ¢7 itself. 
That this estimate of the degree to which sampling functions approxi- 
mate H(e,) cannot be too much improved is established in 

Theorem 10. Let f(t) € E(er). Then an estimate of the form 


.{ k& \sin r(2Wt — k) 
j rs (*) a(2Wt — k) 


cannot be valid independently of er no matter how large the constants C 
and N are chosen. 

Thus a sampling series approximation using (2WT plus a constant) 
terms will not approximate every concentrated function to a degree 
proportional to the unconcentrated energy. As we have seen, this is 
in direct contrast to the theorem previously quoted for approximation 
with the best functions y;. We also have the following negative result 
for approximation by sampling series to within (1 + n)er: 

Theorem 11. For every B < 1, there exists 6 > 0, and er such that 
f- (4) sin r(2Wt — k) 

iswrecrye \Q2V) x(2Wt — k) 
for some f € E(er). 
Once again, this is in direct contrast to the situation with the best 
functions y; as given in Theorem 4. 


We supposed near the beginning that f(t) is actually band-limited. 
Suppose that it is only almost band-limited, that is, that 


[Pe Pde 
(0.5) Meili ANS 


[. | Fw) |? de 


2 
2 
= Cer 

















2 


> (1 + der 














It is interesting that our approximation theorems are stable in the 
sense that they continue to hold approximately for approximately band- 
limited functions. A sample is the following 

Theorem 12. If f(t) € & with || f || = 1, and satisfies (0.1) and (0.5), 
then for some constants a, we have 


[2WT] 


= pa OnWn 


2 


= 12(er + nw) ae nw « 














An analogous result (Theorem 13) holds for a sampling approximation 


to f. 
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Before we proceed to the detailed exposition, let us mention one theo- 
rem, required for the proof of Theorem 10, which is of interest in its 
own right. 


Theorem 9: When restricted to t > O, the sample functions centered at 
the negative sample points are dense in £7(0, ©), but those centered at the 
positive sample points are not dense in £2(0, ©), nor even in B restricted 
tot > 0. Specifically, given any square-integrable f(t) we may find con- 
stants N and a& which make 














N : 2 
_ iv) Sin a(2Wt + n) 
[ I) — 2) an” oer ay | 
as small as desired, but there exists a band-limited g(t) for which 
x sin r(2Wt : 
gt) — 21 bs Ts ay | & 








cannot be made arbitrarily small regardless of the choice of N or b, . 
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Ill. DETAILED EXPOSITION 


1. Given N functions go, ¢1, --: , ¢w-1 in &’, let us denote by S% 
the subspace spanned by them. The quantity ont lf — dot ag: || 


of (0. 2) now represents the square of the Bree p(f, Sy), measured 
in £”, of f from S%. The number éy in (0.2) may therefore be taken to 
equal 
bv = sup p(f, Ss), 
fek(er) 
which, following the terminology of Ref. 4, we will call the deflection 
of E(er) from SQ. 

We will first identify, for given 7’ and N, that subspace of dimension 
N which best approximates H(er), in the sense of minimizing this 
deflection. 

Theorem 1: Let T be given. Then, for every N, the subspace spanned 
by the (orthonormal) functions Wo, +--+, Wr-1 best approximates (er), 
in the sense that the deflection of EK(er) from that subspace is smaller than 
from any other subspace of dimension N. 


1304 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1962 


Proof: We first compute the deflection of E(er) from Sy. By definition, 
f(t) isin E(er) if and only if f € ®, with || f ||? = 1 and |] Df |? =1-— ¢; 
thus, expanding f in the complete orthonormal system {y,}o, if and only 
if 

f= YDaa:, with DYSla:? =1 and DA: |eP=1—-—e6. 

0 0 0 
Now by the orthonormality of the ie ; 
N—1 2 i) 7 
f = » ai a 2 | ox: | 
To find the deflection of E(er) from Sj we therefore compute 
sup o(f, St), 


p(f, Sy) = min aw; 


{a;} 























fEeE(er) 

equivalently oe Qs | ax | Boag to the conditions >>¢ | a; |? = 1 

and >°o ds | as |? = 1 — & S Xo. We find 
1 , O<1-—e& Say 
(1.1) deflection of — _ bees ; 
E(er) from Sy [ei Ga) a= | , Aw <1— 6 So 

\o — Aw 

Next suppose that go, ---,¢n-1 are any N given functions in £’. 


By the Pythagorean theorem, the distance of f € @ to any linear com- 
bination of the ¢; is no dmatles than its distance to the same linear 
combination of the functions Byg;, hence we may assume ¢g; € GB. AS 
before, let SZ be the subspace spanned by go, -:+ , ¢v-1, and denote 
by P, the operation of projecting orthogonally onto S{; explicitly, 
P,f is the element of S% closest to f. In terms of P,, the quantity of 
interest in (0.2) can therefore be written simply as 


(1.2) e(f8e) = lf — Pef I? = IFIP — Wl Pof IPs 
the last equality in (1.2) follows from the orthogonality of P,f and 
(f — Pof). 


Now assign to every f € @ the point in the x — y plane whose x and 
y coordinates are || Df ||?/|| f ||’ and [| f |’ — || Pef Il'1/Il f | respectively; 
denote by FR the set of points so obtained. The significance of this map 
is that it sends every f in H(¢7) into the line x = 1 — ¢} , with y-coordi- 
nate equaling p’(f,S;); hence we see that 


(1.3) deflection of E(er) from Sy = | sup Ji 


2 
r=1—€y, 
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By previous results,’ the x-coordinates of points in Rr satisfy 0 < « < 
Xo ; @ = Ao is achieved only by the functions kYo(t), with k any constant. 
The y-coordinates of points in Ry satisfy 0 S y S 1; y = 1 is achieved 
only by functions orthogonal to S%, equivalently to {y;}¢ ". Therefore, 
applying the Weyl-Courant lemma (Ref. 3, p. 238), we find 


2 
sup v= sup | DF | = dy. 


ee eat TP 





Since there exist infinitely-dimensional subspaces of @ over which 
| Df ||’/|| f |? is arbitrarily small (for example those spanned by Wn , 
Ynii, + for m sufficiently large), while S% is finite-dimensional, there 
are functions in those larger subspaces orthogonal to S{, and conse- 
quently inf « = 0. 

=1 


ijn 

We show next that Rr is convex, equivalently that if P, and P, 
are two points in R;, the line segment joining them is also contained 
in Ry. Let 1 be a line whose equation is av + by = c. By definition 
of Rr, afunction f € B will be sent on a point of / if and only if 


a || Df \F + ots i’ — I Pef IF _ , 
mal 


equivalently, if and only if a(Df,Df) — b(P.f,P.f) = (e — b)G,Sf), 
or, using (0.4) and the fact that f = Bf, if and only if 


(1.4) (laBDB — bP,| f,f) = (e — b) Ff). 


An operator is completely continuous’ if it transforms every bounded 
sequence (i.e. a sequence of functions {f,} for which || f, || < & with 
some i) into a sequence which possesses a subsequence converging in £° 
norm. Since B is a projection, || Bf, || < || f, || < &. Writing Bf,(t) in 
terms of its I‘ourier transform F’,,(w) we obtain 


Q2rw 
1 


Bf, (t) = on oe F,(w) e* de, 


whence Bf,(t) is an entire function of the complex variable ¢. Since 
a function and its lourier transform have the same £’ norm, Schwarz’s 
inequality applied to this representation yields 


(1.5) | Bfn(t) | = ge een 1 F,, I < eke mt 


so that the functions Bf,(t) are uniformly bounded on any compact 
set of the t-plane. Consequently (Ref. 5, p. 171), they form a normal 
family, and the sequence Bf,(t) possesses a subsequence Bf,,,(¢) con- 
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verging uniformly on any compact set of the é-plane, in particular on 
the interval |¢| < 7/2 of the real t-axis. Therefore, the functions 
DBf,,(t) converge in &’ norm as well, whence, since B is bounded, so 
do the functions BDBf,, . We have established the complete continuity 
of BDB. Since S¥ is finite-dimensional, the projection P, is completely 
continuous. By (0.4), both operators are self-adjoint. Consequently, 
the operator A = aBDB — bP,, which takes @ into itself, is also 
self-adjoint and completely continuous. Therefore’ it has a set of ortho- 
normal eigenfunctions @,(¢) € @® with corresponding eigenvalues uz , 
and every function f € @ has an expansion of the form 


(1.6) f= hp + 2, abe, 


where Ah; = 0 or, equivalently, h; is orthogonal to all the 6, . Using 
this representation, condition (1.4) becomes 


D, | ox [ux = (e — BL hy |? + DO Lee Poor 
(LQ) Di | ow ’(¢ — b — we) + (ce — b) |] hy |? = 0. 
We now argue that this set of functions is connected. For suppose that 
f =hy + DS anh and g = h, + >> 6,6, are each of the form (1.6) 
and satisfy (1.7). For every 0 S u S 1 define, for k = 0,1, --- 


vf? = +VularP + =u) [Be Penrersemere, 


h 1 — uh, 
7 mea Vl hy P+ 1 =u) The IP 





and set 
ly, = hu + oy ve? Or . 
0 


We see that Ah, = 0, since h, is a linear combination of hy and h, , 
so that 7, is of the form (1.6); it is easily seen to satisfy (1.7). But as u 
varies between 1 and 0, the functions 7, trace a connected path in ® 
between f and g. Consequently, those functions in ® which map into 
the line 7 form a connected set in ®. Since the map from ® onto Rr 
is continuous, it takes this connected set into a connected set, that is 
into a single segment of /. Thus, the intersection of Ry with any line / 
is a single segment, whence Rz is convex. 

Combined with the information already derived about the points in 
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Rr, the convexity of Rr implies that 


sup y = 1, 0O<1l—e Sy 
z=1—ey? 
(1.8) 2 
sup je se hee eee = a 
z=l~e7? No as Aw 


Combined with (1.8) and (1.1), (1.8) implies that deflection of E(e7) 
from S% = deflection of E(er) from S}. Theorem 1 is established. 

We conclude from Theorem 1 that the quantity 6y of (0.2), measur- 
ing the degree to which E(er) is N-dimensional, may be taken to be 
equal to (1.1). Since, for Ay < 1 — &, 

do — (1 — e7) er 


No — Aw ce tees Ge 


and, forAy = 1 — e7, 








we find 


€r 
1.9 iy << OS... 
(1.9) x hs 


Thus to establish an inequality of the form 6, S Cer with C independent 
of T, it is sufficient to show that d; (7) is bounded uniformly away from 
1 independently of 7. This will be done for k = [2WT7] + 1 in Lemma 2, 
and for k = [2WT] — N, provided T is sufficiently large, in Theorem 8.1. 


2. Lemma 1. Let f(s) be differentiable on (— ©, ). Then for any inte- 
gersmandn,m SS n,andanyO <5 S 1, 


n+B 
fm) ++ + 4a) = | fle) ds + G — 8) (a +8) 


n+B ; 
+ (2 a) fim—a) +] (s — [s] — 3) f (s) ds. 


Proof: The standard form of the Euler Summation Formula (Ref. 6, p. 
539) gives 


f(m) + f(m +1) +--+ + f(r) 
= [* s(s) as + y(n) + 3m) + |” (8 — bs — Hse) as. 
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Our result then follows if 


o= | sls) s+ (4 = @)s(m — a) ~ 44(m) 


+f" (tl Hse) as, 
and if 


n+p 
o= | fos) ds + - 8) s(n +8) — Hn) 


n+8 : 
+f (s— [sl — 4) FCs) ds. 


Both follow immediately by partial integration on the last integrals, 
where [s] = m — 1 and n respectively. Lemma 1 is established. 

We are now in a position to prove 

Theorem 2. Let g(t) € E(er). Then if WT — [WT] S 3, 


_ k \ sin r(2Wt — k) 
(a) | t ee (str) n(2Wt — k) 


andif WT — [WT] > 3, 


k+4\ sin e(2Wi-—-k—-—3 
b — ae 2 
“ | : eee of 2W ) n(2Wt —k —4 


2 





2 
TET + €T y 








2 
2 
S mer ter. 











An estimate valid for all WT may be obtained by replacing WT in (a) 
by WT + 1. 

Proof: Without loss of generality, we assume W = $ for convenience. 
We apply Lemma 1 with 8 = 0 and f replaced by | g |’. Then if a S 1, 


n 


jat(m) | +o + ign) = fo [ots) fas + 41 a(n) | 


+ (=a) |@(m—a)|+ f(s [sl — §) 2Relag’) ae. 
It follows that if} S a < 1, 


gm) +--+) ol sf [gee 
(2.1) ote 
= he (s — [s] — 4) 2Re(gg’) ds. 
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If0 <a <i, weset f(s) = |g'(s +4)] and oa =a +4. We have 
[g'(m + 3) [+e +] om + 4) | 


= [0 ot Dla + 4ie@ $9 
+ (2-—a@)|¢(m+4-<a)| 
+ [ (== 4) mRelols + Dale + D) as, 


or 


lo(m+4)] +--+ +)en+4), 


= [sou [aut 8] ohn + P| al am — «) | 


n+} 
ie is (w — 1 — [w — 43]) 2QRe(g(u)g'(u)) du, 
and 


n+3 
igim + ait tim — pis flew lau 
(2.2) aii a 
zoe i (u —1— [wu — 4]) 2Re(g(u)g (u)) du. 


— a 


If } < a < 1, we may apply (2.1) to | g°(t) | and | g’(—#) |, and add 
the results. We obtain 


2a Ig(k) |S / | g°(s) | ds 


mS|k\|< 
m—as|s|<sn 


+ | (s—[s]— 4) 2Re(gq’) ds. 


m—as|s|Sn 


Now | 2(s — [s] — 4) | S 1; hence 


s | tollo' las 


s4/flatas4/ fia fas. 





|| (s — [s] — 4) 2Re(gq’) ds 
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by the Schwarz inequality. But 


[ leta<f te Pas 


m—as|s|Sn 


25) Wiens 


The last inequality holds because’ - 


g(t) 2 ie G(x) e*’ dx 


g (t) 


i ixG(x) e*' dx 


[ido dt = 2" | 2° G(n) |? dx 


IIA 


ona’ | | G(x) |’ dz 


ef |g) Pat 


Hence 
> ivtis f[ les)las+e| [las las]. 
ms |k|Sn m—-as|s|Sn m—as|s|sn 


Now let n — ©; the preceding equation becomes 


os | g (k) | = eis) + 1€2(m—a) - 
ms |k| 
Furthermore, 


_+ _\ Sin r(t — k) 
g(t) = 2X g(k) me ay a 


and the functions sin r(t — k)/r(t — k) are orthonormal. Hence, 
2 sin x(t — k) || 
Dy 9 (k) =\9(t) -— 2D g(k) —S—— 
mS |h| Ik] < m a(t — k) 
If we now set m = [7/2] + land m — a = T/2, thena = $ if 


T/2 — [T/2\ & 7, 








and we obtain (a). 
Exactly the same argument, based on (2.2) rather than (2.1), gives 
the result (b) for the case 77/2 — [7/2] > 3. 
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If in (a), we use for [7'/2] the integer m + 1, then 


[aw - yo gt © 


2 
< riven + 
= +1} T€2(m+1) 
[| Smt a(t — k) | 





and, since eg is monotone decreasing in a@, the last statement of Theorem 
2 follows. 

Corollary 2.1. Let g(t) € E(er), and let W = 3 for simplicity. If, in 
addition, g(k) = 0, |k| S T/2, when T/2 — [T/2] S 4, orf g(k +4) = 
0,|k +4{| Ss T/2, when T/2 — [T/2] > 3, then 


| Dg | S wer. 


Proof: This follows immediately from substitution into Theorem 2(a) 
and (b) of the additional conditions on g(t). 

Notice that the number of points at which g is required to vanish is 
[T] + 1, except if 7/2 — [7/2] = 4, when it is one less. 

Lemma 2. With the normalization of W = 4, for any T > O 


Arrigi(7") S 0.915. 


Proof: Let us consider a function of the form 
(r]+1 


(2.3) f= a AnWn(t). 


The series contains [7'] + 2 coefficients to be determined; it is therefore 
possible to make f vanish at the (at most) [7] + 1 integer or half- 
integer points a, of Corollary 2.1 without having f vanish identically. 
More precisely, we wish 


[r]+1 
> awl) =0, k=0,1,---, [7]. 
The rank of the matrix {y,(az)}, n = 0,1, --- [T] +1,k =0,--- , [7], 


is at most [7], and hence there exists a solution vector {a,} not all of 
whose elements vanish. We may then pick the a, so that >> | a, |? = 1. 
We have thus found a function of the form (2.3) and of total energy 
one, which vanishes at the [7] + 1 points of the Corollary 2.1. 

We know for this function that 


5 [7]+1 ‘ 
fo isfae= Xo lates, 


le] S7/2 


{7]+1 
| jjfaeas > pays 


0 





| 


jél>7/2 
1 


(1 — An) | an |*. 


(7 


+ 


T 
M 
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. . ° . 9 
Since the A, are decreasing in n, we have, remembering >> | a, |? = 1, 


[7T]+1 


Arr}41 s yy | Gn ee 


0 





{7]4+1 


a > (1 — An) | on |’ by Corollary 2.1, 
0 


mrV/1 =: Atr}41 . 


Therefore A,741 is bounded from 1, and is, in fact, no larger than the 
root of the equation 


IA 


IIA 


r= rl = a 
which is 


—r t/t tar 7 
5 = 0.915. 


_ 





Lemma 2 is established. 
3. Theorem 3. Let f(t) € E(er). Then 


| (2wT]+1 2 





< 126 


4 
= T >) 


: i 2 AnWn 





where the a, are the Fourier coefficients of f in its expansion tn the functions 
Wr - 

Proof: The quantity defined in the theorem represents the square of 
the distance from f € E(er) to the subspace S?"7!™ spanned by the 
functions y, , with 0 S$ n S [2WT)]. Thus, by definition, it does not 





exceed Sfowrj4i, the square of the deflection of E(er) from se a 
Combining (1.9) and Lemma 2 now yields 
[2WT] 2 e e 
3 awn < < z = 126). 
I x Me | ae hema Le 001G 











Theorem 3 is established. 
4, Theorem 4 (Shannon). Given any n > 0, there exist constants 

Cz = C3(y) and Cs = C4(n) so that for f € E(er), 

[2WT]+Czlogt2wT+c, 2 


i 2 aw: || S (1+ ner. 











inf 
a; 


Proof: Using properties (72) and (ziz) of the eigenfunctions y;, and 
known results (Ref. 3, p. 242), we obtain 


pa(t — 5) = SRE = pels vid 
T s) 0 
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Therefore 


oi(0) = > veo, and 


T/2 eo T/2 cd 
41) f  pa@a=2wr=d[ vta@a= dx. 
ap 0 re 0 
We now proceed to estimate >>> \7. The functions y; satisfy the 
integral equation 


T/2 
roid) = | vals)o(t — 8) ds 
-7T/2 
Then 
st ae Pid: pr ]2 
Ar | vi (t) dt = | i pi(t — s)Wils)¥i(t) ds dt, 
—T]2 —-T/2 4-T]2 
and if we sum on 7, we obtain 


0 T/2 T/2 
i = i or (t — s) ds dt. 
=O — — T {2 


T/2 


v 


We now set 
s = 28/T, t = 2t/T, ¢ = «WT, and p(u) = sin cu/(ru). 


Then 
1 1 
i / ou! — 8’) ds’ dt’ 
Ly J 


1 1—s’ 
| ds’ | p (u) du. 
—J js’ 


Integration by parts, and the substitution cu = x, give 


20 Qe + 2 2c + 2 

2 4c sin” & 2 sin’ & 
DAY = 4 dx — = dx. 
i=0 0 


9 
rT? 


oe) 
2 
Xi 
0 


v 


I 








he TT” “0 x 
Asymptotically for large c, this is easily seen to equal 


2c 1 
cir nara 1 
a log c + 0(1), 
but we desire an actual lower bound. For c = 7/8, 


- 2 387r/4 + 2 2 :_ 2 
ng §«€68e—Stsé4ee ff Sin’ 2 Q [374 sin? x 2 f° sin’ x 
ye eee ae es ge ae 
i=0 T 0 TW” 3 


Te Joo 2? Tr x r/4 2 








Therefore, 
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< 2c 4c [f° ad ae gos of 
ie Sea) ae ede — = Gd cone 
i=0 vis via 2e Tw 9 TW" '3r/4 xv 
cr ww 16 # 3/4 
since 
i cos 2x dx S56 ah oS als 
8r/4 xv 
Thus 
(4.2) i Se eed 
0 Tv Tv 


for all c, since the inequality is trivially true for c < 7/8. 
Let us now introduce the following combinatorial problem. We con- 
sider infinite sequences of non-negative numbers yu; such that 


(aj l2uwzme2a ree, 
(b) >> u; = A, a given positive constant, 
0 


(c) for a given integer m = A, um has a prescribed value, 


and we seek to maximize Da u; over all such sequences. Clearly the 
optimum {y,} will have un; = 0 if 7 > m. 

We claim that, with the possible exception of one u;, all the others 
in the optimum solution equal either 1 or u,, . or suppose they do not, 
i.e. suppose {u,} takes on two values a and @ such that un <a <6 <1. 
If we now vary a and 8 between the limits yz, and 1, keeping a + 8 
a constant, and maximize a’ + 6°, we find an end-point maximum. 
In detail, if a + 8 = s, thena’ + 6” = 2 [(a — s/2)’ + s°/4], which 
is maximized at an end-point value of a. Thus, the maximizing sequence 
{un}’o can contain only one value which is neither 1 nor u,,. This odd 
value is due to “breakage” in obtaining the exact total A. Let the 
maximizing sequence have k “1's”, (m — k) “un’s’”? and one value 
Q, bm <a < 1. Then 


k+(m—k) um ta=A, 
so that 


p - ATT Mm 
1 — Un 


PROLATE SPHEROIDAL WAVE FUNCTIONS — III 1315 


Then 





Se eh A ae 
0 


(4.3) ao Lm eo Lm 


(A —e a) (1 a ine) — Min ae a, 

This is the maximum achievable value of }°f uj under the conditions 
(a), (b) and (c) above. But with A = 2c/z, and \p given, m 2 2c/z, 
the sequence of eigenvalues ); satisfies the above conditions. It therefore 
competes for the maximum, and hence 


2 2Q¢ 
Loge y= 25%; <=(1 + Xm) = MAn. 


Tv Tv 0 


Thus, for any m = 2c/z, 








+ 
oe c 14 
\n S— 
sar 2c 
m—-— 
Tv 
Tor any given 7 > 0, if 
9 
(4.4) meeps : 2 (8 +1) 


it follows that \», S 7/12. Then, by the reasoning of Theorem 3, 


er” 


a5 Bae a 
f= 2) anibn || =>: 








inf 
If 7 S 11, this implies 
f- > Onn || SL + ayer; 


larger values of y are covered by Theorem 3. Theorem 4 is proved. 

Note: If only small values of 7 are of interest, the ‘‘12” in (4.4) is of 
course unnecessarily large. 

Lemma 3: With the normalization of W = 3, we have for any T > 1, 


Arrji-a (T') = 0.085. 


Proof: We begin, again, with Lemma 1. If we consider first the case 
T/2 — (T/2| = 4, we let f(s) = |g°(s) |, and 


1 ih ff T 
pep patt e n= [5], e-5-|F |; 


b) m= -|P | —1, a= -f-(f 








inf 
a; 
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We obtain 


7/2 


it --le Uris [ las) las 


ey 
2 


T/2 


+ [ (s= [sl — ¥) 2Re(gg’) as 


2 


and 


La(-(E/) | +--+ +190) 1s f | G(s) [as 


~—T/2 


F / (s — [s] — 4) 2Re(gq’) ds. 


—T/2 


Adding and applying the Schwarz inequality, we find, as in Theorem 2, 
(4.5) > |g(m) | S Dg IP + x [1 Doli iig I. 


In} <[7/2} 
The Weyl-Courant lemma (Ref. 3, p. 238) asserts that 
|| De |)’ 


An = inf sup a 
An P44n {| © || 





where A, ranges over all H-dimensionnl subspaces of ®. If B41 is an 
(n + 1)-dimensional subspace of @, the orthogonal complement of 
every A, must have at least one vector in common with B,4,. Thus 

2 


via, Ol wena lel * 


2 








and since the right-hand side of the inequality is independent of A, , 
the Weyl-Courant lemma implies 





2 
(4.6) oe ae ee 
veauy lle lP 


Now let B;r be the subspace of ® spanned by the [7] (orthonormal) 
functions [sin r(¢ — k)]/r(t — k), | k| S [7/2]. For g € Bir we have 


lgIP= lal) &, 


InlS{7T/ 


since g(n) = 0 when | n| > [7/2], so that (4.5) yields 
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2 
1g HDol, Pel) 
Il g I? II g || 
Letting g vary in B;7r , and using (4.6), we now find 
2 2 
tg inp WPelhy Del inp Wel yy gg Dol 
seziry lg | lg || seziry |g | ceniry lg |i 
SNnatdt V dry ) 
whence 
Arr} 2 0.085. 


Similarly, if 7/2 — [T/2] < 3, 
[gin +4) P S| Do lP +7 [Doll loll, 


IntaYStr/2) 


and letting Bir, be the subspace of @® spanned by the [7] functions 
[sin r(t — k — 4)]/[r(t — k — 4)] with |& + 3] S [7/2] we may 
apply the identical argument to find A;7j}1 2 0.085, as before. Lemma 3 
is established. 

Lemma 4: For any WT > 0, 


Mo > 1-2/2”: 


Proof: Yor convenience, let 2 = 2rJV, and normalize so that T = 2. 
Consider the function f(t) whose Fourier Transform F(x) is given by 


1 —xr2/2Q . 

— + ¢ ye! =o 

F(x) = aa i 
\O if || > a. 

Then 


[ise dt = ae [ Ce) dx aia [Yo an 


On the other hand, 


fi) = f F(x) cos xt dx 


2 aa) —{¢2 = —72/22 
Ck _ mQ —129/2 | 22/2 Po) 
(4.7) al 4/% € ‘ é cos xt dt 


2 QO —t2Q/2 i —x2/209 
= — — dx |. 
~ (rQ) | 5° 2 : 





V 





~ 
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It is easy to check that the expression in brackets is non-negative for 
¢ = 1, and hence for | ¢| < 1. In fact, 


- —12/22 pa ‘ —u2/2 
€ dx = Q / € du 
i BVA ae , 


a 7 —2/2 __ er 
G(Q) = /2 e ee € du 


is non-negative since it equals 0 at both Q = 0OandQ = o, and 


G'(a) = 8" be 2 va) 


is positive for Q < 2/z and negative for Q > 2/x. Thus 

1 Jd pe J 
l P(t) di > 4 vz | c™ du — 8/2 i eee au | et? dy, 
= ‘ V2 0 


Hence, from (4.7), 


[foe eo) oe . ie ff re a ~u2/2 an 
[roa Wi, | [ae aly uf 


i ee du < V3 e ? because G(Q) = 0 
Jt 2 
and the expression in parentheses is bounded by +/2. Thus 
1 
[ roa 
1 
[ P(t) dt 


But f € @ by definition, and hence competes in the maximum problem’ 
which defines \> . Hence 


1 
[ | f(t) | de 
No = max ———_—_= > ] = 2/2 6 P?, 


® f lsmPat 


and 


But 


>1—2+/f2e°"", 


Lemma 4 is established. 
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5. Theorem 5. For any ep < 0.915, there exists a function f € E(er) 
such that 

















. aoe : 1 2 = WT/2 
inf — wv: || = —2 a : 
oe f - aw = 0915 (er V/2¢ ) 
(If e& = 0.9165, the right-hand side of the inequality should be replaced 


by 1.) 

Proof: Theorem 5 asserts the existence of a lower bound for the deflec- 
tion of E(er) from the subspace S¥’"7!-’, spanned by the functions 
y, , withO < k s [2WT] — 2. This deflection has already been calculated 
in (1.1) and is easily seen to be assumed by a function in E(e7). Thus 
there exists f € H(er) such that 


= min E ewe ama) — (i = | 


: No a Afawr]—1 


(2Wwr]—2 
inf ||/f—-— >> aw: 
0 


a; 














2 


By Lemma 3, Apwrj-1 2 0.085; this ensures that when e7 = 0.915 
the smaller of the terms is 1. For other values of «> , since 


b> 1 — fee"? 
by Lemma 4, and Ay < 1, we find 


P AN -— 1 + ) ; | 1 2 ~ —rWT/2 
jo |S fies (a 
min ( [ean rere min O15 Ce 1/2 Ale 
and now the second of the bracketed terms is the smaller. Theorem 5 
is established. 

6. In £°(—~«,«) let D’ denote the operation of projecting onto 
[0,0], that is 


ft) 420 
re lo t<0 


Arguing as with DBD in the proof of Theorem 1 we see that D’BD’, 
which takes £7(0,< ) into itself, is self-adjoint, positive, and bounded 
by 1 (though no longer completely continuous). It therefore has a 
spectrum’ contained in the unit interval; we will show that its spectrum 
consists of allO S$ A S 1. 

Theorem 6. The & spectrum of the operator 


sin (x 
x — 


DBD f= i Gy SG, 
Tw “0 Yy 
consists of allO SX S 1. 


Proof: Theorem 6 follows immediately as a special case of much more 
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general results of H. Widom’ and M. Rosenblum (unpublished), which 
determine the spectra in £° of Wiener-Hopf equations with kernels 
whose Fourier transforms are bounded. We include a separate proof 
only because it is constructive. 

By definition, \ is in the spectrum of an operator A if and only if 
for every « > O there exists ¢, such that 


(6.1) | Ag =~ Age | 
| Pe | 

We will prove the theorem by constructing functions which satisfy 
(6.1) for any given 0 < »X < 1. The spectrum being a closed set, it 
must then include all 0 $ \ S 1, but by the introductory remarks it is 
also contained in the closed unit interval, hence it consists of precisely 
the points 0 S A S 1. 

Lemma 6: Let uw > O be given. Then corresponding to any 6 > O there 
exists a function H;(z) satisfying 


<6. 


a. H;(z)is analytic in |z| < 1, continuous in |z| S 1, 
b. H;(0) = 0, 


J \utt(e®) + Hse) ff sin 0 as 
i ets ea ees 
j | Hs(e”) |? sin 6 do 

0 


Proof: Suppose 0 < a < 7/2. Denote in the z-plane by P, ,P2 ,P3 ,P4, 
P;,Ps5 the points 1,e’*,—e ‘*,—1,0, and 7 respectively, and let 1,72 
represent respectively the arcs P,P. ,P3P, of the unit circle. Let w = 
P,(z) be a conformal map of the upper half of the unit dise onto the 
region in the w-plane defined by 1 < |w| <q < ~», Im{w} > 0, which 
takes the points P; ,P2,P3,P,, ontow = 1,w = q,w = —q,w = —1 
respectively. The required map exists as soon as qg is chosen appropriately 
(for example, so as to make the extremal length of the family of curves 
joining 7: to ye in the upper semicircle equal to the extremal length of 
the family of curves joining the two segments of the real axis in the 
image domain), and it defines g uniquely. Now by reflection, Po(z) is 
extendable across the diameter of the unit circle to a map of |z| < 1 
onto the domain 1/q¢ < | w| < q, Im{w} > 0, and satisfies 


P,(e")/¢ = Pye”) ax@<7r-—a 
| Po(e”) | = q ax@<7Tr-a 
1/q < | Pole”) |, | Pole”) | < ¢ ev EN, ye. 
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Choose r so that p» = gq”, and let P(z) = [g¢Po(z)]". Since Po(z) is 
bounded away from zero and infinity in |z| S 1, the function P(z) 
is also analytic in | z| < 1, continuous in |z| < 1 (though no longer 
necessarily univalent) and satisfies 


uP(e”) = P(e”), ax<0<7-—a 
| P(e”) | = 1/n, a<@0<7-a 
my, = min (1/u,1) < | P(e”) |, | P(e”) | < max (1/u,1) = My, 
ev € 41,72. 


Next let w = Q(z) map the region defined by |z| < 1, Im{z} > 0, 
Ref{z} > 0 onto itself, taking the points P; ,P; ,P2 onto w = 0, w = 1, 
and w = 7 respectively. Q(z) may be constructed from elementary maps 
and is given explicitly by 


4/ cos a+ (5 as =) sinka — 1 
Q(z) a ee Ne 


2\ 2 : 
4/ costa + (+5) sink? a + 1 


It may be extended by reflection to yield a map of |z| < 1 onto the 
domain in the w-plane formed by cutting the unit circle along the 
imaginary axis from z[sin a/(1 + cos a)| to 7 and from —;[sin a/(1 + 
cos a)| to —7z. It satisfies 


Q(0) = 0, 
Q(e") = —Q(e"), a<@0<7Tr-ea 
[Qce")| = |Q(E")(=1, een m. 


Now form H(z) = P(z)Q(z). We see that H(z) satisfies conditions 
(a) and (b) of Lemma 5. Furthermore, by definition of H, 











uH(e") + H(e”) = 0, a<@0<r-a 
| He") | == | Qe") | ee ee 


m <|H(e") |, |H(e™) |< Mi, ee Ens. 


Thus 
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je | wH(e”) + H(e~”) |’ sin 6 do 


(6.2) I -. iz | uH(e”) + H(e*) le sin 6 d@ 


IIA 


2(u + 1)°M : sin 6d0 = 2(u + 1)°M2(1 — cosa). 
0 


[ |e) Psinao> [| He) P sino ao 
(6.3) ° 
1 aoe 70 2: 2 aus 70 25 
= 5 | Qe") | sin 6 da = — | QCe”) |° sin 6 dé. 
ML’ va@ MN va 


Using the expression for Q(z) we find, fora < @< 7— a, 


cos? @ sin? a sin? @ 
cos? a — —~~—— — l 1-- —1 
/ sin? @ sin? 6 


70\ |2 
é = SEE |S SSS 
poles a COs PSIG at 4 4) Sima y 
a sin? @ / sin? @ 


sin? a |” 
1-4/1-"l 2 
sin? 6 


Introducing this into (6.8) yields 











2 
_ sin’ 6 
sin? @ 

















1 + cos “) 


v : 1 
H w0\ 2 5 ws 2 ( : 
[ | H(e") |" sin 6 dé > 5a log ane 


whence, by (6.2), 


/ | wH(e”) + H(e~”) |? sin @ do 
F K, 


: > log esc a’ 
i | H(e”) |’ sin 6 de = 
0 


where K, depends only on yu. Thus, if a is chosen sufficiently small, 
H(z) satisfies the remaining condition (c). Lemma 5 is established. 

We now pass to the construction of the functions ¢, of (6.1). Given 
0<A <lande > 0, set 0 < uw = (1 — XA)/A, choose 6 so small that 
that V6/(1 — VW/5) < «, and let H;(z) = H(z) be the function of 
Lemma 5 corresponding to 6. 

Introduce the map u + w = w = 3(z + 1/z), taking | z2| < 1 onto 
the w-plane slit along the real axis from w = —1 to w = 1, and in 
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that region define F(w) = H(z). The function F(w) is then analytic 
except on the slit. If /\(w) denotes F(w) in the upper half-plane, 
F,(w) is continuous in the closed half plane v = 0. If v > 0, 


| 


where I, is the curve in the upper half of the unit circle defined in 
polar coordinates by (r — 1/r) sin 6 = 2v, Since H(0) = 0, the func- 
tion [H(z)/alV/1 — zis analytic in |z| < 1, continuous in |z|] < 1, 
and by the maximum principle 


UO) Vie 


By property (a) of Lemma 5, H(z) is bounded in |z| S 1, hence so 
is the integrand on the right-hand side of (6.4). Since the curves I, 
have lengths bounded independently of v, it follows that 





(64) [| Fut a) Pau = 5] | H(z) 





Dae 


< sup 
= Zz 


fz|=1 














< sup | H(z) |. 
j2{=1 


| |Fi(u+ iw) Pdu<c, v>Q0. 


Consequently, by a theorem of Paley-Wiener,’ F,(w) coincides in 
v = O with the Fourier transform of a function y; (4) € & which van- 
ishes for t = 0. Letting F.:(w) denote F(w) in the lower half plane, 
the identical argument establishes that F.(w) coincides in v < 0 with 
the Fourier transform of a function y(t) € £&’ which vanishes for 
t $0. Let oi(t) = (1/\) Wilt), g(t) = —C1/r)¥a(t), and g(t) = g(t) + 
go(t). Then with x(w) the characteristic function of the interval —1 < 
wu < 1, using the norm-preserving property of the Fourier transform, we 
have 


|| BD'p — re |? = || (B — ex — Avi Il’ 


Pi) F,(u) | 


= [7] oxtuy — yw 
(6.5) =f Lele) — Fay fae 
+ | \aPs(u) + Pu) Paw 


= / |ul(e”) + He”) | sin @ da, 
0 
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while 
lel = lel? = Sllvel? 
(66) = [| Polw) Paw > 5 [| Pal Pau 


- s | H(e) sin 0 ds, 
0 
Thus, combining (6.5) and (6.6), 


‘ ‘9 ~i9) 2 
| BD'e — ro | f |pH(e") + H(e™) | sin 6 dé 


(6.7) “Tek <A - < n’6. 
4 i | H(e”) lh sin 6 dé 

Trom (6.7), 

(6.8) || D’g || = || BD’e || = A — V6) Ile ll, 

and 

(6.9) || D'BD’g — De || = || D'(BD'g — dg) || S Ve Ie |. 


Setting ¢, = D’g and combining (6.8) and (6.9) we obtain 
|| D'BD'g. — dye I/II ¢ || S V8/Cl — V8) <6, 


which is the required inequality (6.1). Theorem 6 is established. 

7. Theorem 7. Given any subintervalO <a Sx S B < 1 of the unit 
interval, there exists Ty such that for all T > Ty, the operator BD7B 
has an eigenvalue contained in [a,8}. 

Proof: Let } = $(a + 8) and choose ¢ so small that 3¢/(A — 3€) < 
(8 — a)/2. Since 0 < dX < 1, by Theorem 6 there exists a function 
ye € & such that 


|| D'BD'e — de || 
Il ¢ Il 


Since g and D’BD’¢ are fixed functions in £’, there exists 7’) such that 
for each T > T% 


<eé 


| (D - Diet = [leo Far <éllel? 
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and 
| (D' - Dr)BD'e |t = | | BD) Pa <é lel? 


Using the inequality || DrB(D’ — Dr)g || S || (D’ — Dr)g || we then 
find 


|| DrBDrg — d¢ || 
ll ¢ ll 

_ || D'BD'g — Xv — (D' — Dr)BD'g — DrB(D' — Dr) | 

7 ll 2 || 
(7.1) < || DBD'g — de || , || (D = Dr) BD | 

~ ll ¢ II ll ¢ {I : 

1 DrB(D' — Dr)¢ || 
ll ¢ || 

< 3e. 
Now from (7.1) we see 
(7.2) || Dr ¢ || 2 || DrBDr ¢ || 2 (A — 86) le || 
and 
(7.3) || DrBDr g — Dr ¢ || = || Dr(DrBDr ¢ — dg) || S 3ell¢ || 
so that, combining (7.2) and (7.3), 
(7.4) || DrBDr ¢ — Dr ¢|\/|| Dr || S 3€/(A — 36). 


Now by property 7 of the functions y; , we may expand D, ¢ in a series 
Dre = Yi Angn , Where on = Dr pn/+/dn(T'). Inserting this into (7.4), 
and using the fact (77) that the y,(t) (which depend also on 7) are 
eigenfunctions of BD;B, we find 

( 3¢ y , || DrBDre —Dre|l _ || De awOn(P) — Aen |! 

h — 3e || Dr ¢ |? | do anen |? 
_Slaalt (2) — al 
de | aa |? 


> inf |\,(7) — Al’. 


We conclude that for every 7 > 7) there exists an eigenvalue X,(7') 
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of the operator BD 7B with.|\n(7) — | S 3e/(A — 3€) < (6B — @)/2, 
or equivalently, since the ,(7') are all real, that a < X(T) < 8B. 
Theorem 7 is established. 
Corollary 7.1 The number of eigenvalues of the operator BD 7B contained 
in any subinterval J of the unit interval cannot remain bounded as T— ~. 
Proof: Given any integer N, subdivide J into N disjoint intervals J, . 
By Theorem 7, for all 7 sufficiently large each J, will contain an eigen- 
value of BD;B, hence J will contain at least N such eigenvalues. Since 
N was arbitrary, Corollary 7.1 is established. 

8. Theorem 8. Let any integer N and ep < 0.916 be given. Then as 
soon as WT is sufficiently large, there will exist a function f € K(er) 
such, that 


{2WT]+4+N 


I= x aw; 





2 
* 1 2 —nxWT/2 
> = 
aH 2 0.91 (Ep 2/2e +m 


(If & = 0.916, the right-hand side of the inequality should be replaced 
by 1.) 
Proof: By Lemma 3, we have 


Apwra (2WT) = 0.085. 














By Corollary 7.1, there exists a constant ko), depending only on JN, 
such that for all WT > ko the interval 0.084 < x S 0.085 will contain 
at least V.+ 2 eigenvalues of BD7B. Hence 


Aw 7]4+N41 (2WT) = 0.084. for WT > ko . 


Now the proof of Theorem 5, applied without change to Apwr+yni, 
establishes Theorem 8. 

Theorem 8.1 Let er and any integer N be given. Then as soon as T ts 
sufficiently large 
[2WT]—N 2 


,= a aw; 


= 12€ 


inf 


a; 


for all f € E(er). 
Proof: According to Lemma 2, 


Apwr4t (2WT) S 0.915. 














2 
T 


By Corollary 7.1, there exists a constant k,, depending only on JN, 
such that for all WT > k, the interval 0.915 S x S 0.916 will contain 
at least N + 1 eigenvalues of BD7B. Hence, 


Mewn_w (2WT) < 0.916 for WT > ky. 
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Applying. now the proof of Theorem 3 to \pewrj—-w (2WT) establishes 
Theorem 8.1. 
9. Theorem 9. A. The restrictions to t > O of the functions 


[sin r(2Wt — n)]/(2Wt — n), 


for n < —1, are dense in £(0,@). 
B. Their restrictions to t < 0 are not dense in &(— © ,0), nor even 

in B restricted tot < 0. 

Proof: Without loss of generality we may take W = 3, to simplify 


notation. We begin with part A. Let 


| Tl, (t20 
c(t) = 
Oy. Oe 


The functions 


sin r (t — n) 
t—-n 


gr(t) = c(t) 


all lie in £7(0,«), so that their being dense in £°(0,) is equivalent 
to the statement that h(t) = 0 is the only function in £7(0, ) which 
is orthogonal to ¢n(t), n S —1 (Ref. 3, p. 72). We will prove A in 
this form. 

Accordingly, suppose that (h(t),¢.(t)) = 0,n S —1. Using the Parse- 
val theorem, and letting x(w) be the characteristic function of the in- 
terval |u| S 7, we find 


0 = (ald,en(0)) = | oa(, BZE—™) | 
mt inu 
ey = (H(u) x(ue™) 
= (x(u)H(u),e), n s =|), 
where H(u) is the inverse Fourier transform of c(t)h(t). The function 
x(u)H(u) is in £’(—z,r) and may therefore be expanded there in a 


Fourier series x(u)H(u) =4 cr» axe”. By (9.1) the coefficients a, 
vanish for k < —1, so that 


(9.2) x(u)H(w) = Do ave’; 
also 
(9.3) Slak=2f [HW Pau < @, 
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The function H(u) may be continued analytically into the upper half 
of the w = wu + vw plane by its defining formula 


H(w) = i h(x) e*” dx , 
0 
from which, by Parseval’s theorem, 


(9.4) [ [Hut i) Pau<a<e, v 


IV 


0. 


Set G(w) = deo axe*”; the function G(w) is then also analytic in the 
upper half-plane v > 0, and is periodic there, with period 27. We will 
now show that (9.2) implies H(w) = G(w) for v > 0, consequently 
that H(w) is also periodic in v > 0 with period 27. It then follows from 
(9.4) that H(w) = 0, hence that h(2) = 0, which was to be proved. 
We model our argument on one given by A. Beurling (unpublished). 

Applying the Schwarz inequality to the defining expressions for H(w) 
and G(w) we find 


(9.5) |H(u+ w)|, |G(u + w)| S k/V, 0 <0 < 2. 


Next set F(w) = H(w) — G(w) inv > 0. 

Let 0 < e€ < 4%, and in the w-plane denote by P, ,P. ,P2 ,Q2 ,Q. Qi 
the points 7,7 + ter + 1,-m7 + 1, —a + te,—7 respectively. Let I',T. 
be the arcs made up of the line segments P,P. + P2Q. + Q.Q; and 
P,P. + PP. + QQ: respectively. Let Ri, R; be the rectangles |u| < 
a/2,4 <v <  jand |u| < 7/2, —} < v < —} respectively, and R a 
region which contains 2; and R, and whose closure does not intersect I. 

Form the function 


Tw) = { Fa 


r¢t—w- 





By (9.5), F(¢) is integrable on I, so that J(w) is an analytic function of 
w for w off T, in particular for w € R. Now we rewrite 


PO) dt, f PW at 
rr, ( —w r¢—w 
and estimate the second integral of (9.6). If w € R, U R, and ¢ € I. 
we see that 1/|¢ — w| < B < . Consequently 


F(s) dg -f P(g) do yf" Plu + te) an| 


r, { ie) 1P.4+9.Q1 > -—w 7 U + te — WwW 


(9.6) J(w) = 





(9.7) 


IIA 


Bf |r a+ Bf | Pu tee) |du. 
P1P.+Q¢Q1 Ft 
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By virtue of (9.5), 


(9.8) lim | | F(¢) de] = 0. 


e>0 “Pi P.4+0.Q91 


Applying the Schwarz inequality to the remaining integral of (9.7), and 
using the definition of F and the triangle inequality in £? we find 


1 T T 3 
Fal | F(u + ie) jaws {f | F(u + ie) Pau} 


< (Pi imc + ie) — H(u) F au 
(9.9) : 
+ iL 100 — G(u) P du} 
i if Gen sae in) 
By definition of H(w), 
H(u + te) — H(u) = ia h(t)[e""* — le dt, 


whence by Parseval’s theorem 


a | H(u+ ie) — H(u) | du Ss [ \am+ io — H(u) du 


(9.10) 20 
x an | ia(t) Blew —1 [dt 


For each ¢, lim | h(t) |? | 1 — e“‘|* = 0, and | A(t) |? |1 —e "|? s 
e>0 
4| h(t) |, which by assumption is an integrable function. Consequently 
by the fheokew on dominated convergence (Ref. 3, P. oe) applied to 
the last integral of (9.10), 
(9.11) lim [| W(u+ ie) — H(u) du = 0. 
€>0 —T 
Similarly from the definition of G(w) 
G(u) — Gu + te) = Da (1 — &“*)e™, 
k=0 
whence 


f |G(u) — G(u + ie) PP du = 2n Do la Pll — & * /, 
—T k=0 
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so that using (9.8) and arguing as above 
(9.12) lim [| @(u) — @(w+ ie) du = 0. 

' e>0 —T 


Combining (9.12), (9.11), (9.2), (9.9), o 8) and (9.7) we find that 
uniformly for w € R, U R2, 


(9.13) lim 


es 0t 





[ F(g) dg} _ 


f-w 





Since T — I forms the boundary of the rectangle |u| S 7,« Sv <1, 
in whose interior F is an analytic function, the first integral on the right- 
hand side of (9.6) is equal to F(w) for w € R, and to 0 for w € Rp. 
Irom (9.18) it follows that J(w), which is independent of ¢, must itself 
coincide with ’(w) for w € R; and with O for w € R.. But if J(w) = 0 
in R, , it must be identically 0 in its whole domain of analyticity, in par- 
ticular in R, hence also in R; . We conclude that F(w) = 0 in R,, hence 
in its whole domain of analyticity v > 0. Thus H(w) = G(w) inv > 0, 
whence, as we have already argued, part A of Theorem 9 follows. 

We now pass to a proof of part B. We remark first that the restric- 
tions of ®@ to t < 0 include the functions [sin r(2Wt — n)]/(2Wt — n), 
n = 1, restricted to t < 0. Replacing ¢ by —#, we see that, by part A, 
these are already dense in £°( — © ,0). Consequently to prove part B it is 
enough to establish its first assertion. 

We argue by contradiction. Accordingly, suppose that the restrictions 
tot < 0 of the functions [sin r(2Wt — n)]/(2Wt — n), forn S —1, are 
dense in £°(— ©,0). Then defining the function g(t)«£(— ©,0) by 


* 1, -lstso 
t) = 
0, i <=1, 


we could find a sequence of functions f,(¢), each some linear combination 
of the [sin r(2Wt — n)j/(2Wt — n), n S —1, such that {f,(¢)} ap- 
proaches g(t) in £°(— 0,0), i.e. such that 


(9.14) [lo - £0 Pat = 4 0. 
The triangle inequality in £°(— ©,0) applied to (9.14) yields 
(9.15) a | f(t) dt s Lf | g(t) Pat] is Va} = (1+ Ve)’. 


Now the functions f,(¢) are all band-limited and f,(k) = O fork = 0. 
Thus by Ref. 9 there exists a constant C, such that 
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(9.16) [incras ce | ico Pat 


From (9.15) and (9.16) it follows that, as elements of £°(— ©, ), the 
functions f,(¢) have uniformly bounded norms as soon as én < 1. Ap- 
plying (1.5), we conclude that the f,(t) are a uniformly bounded family 
of analytic functions in the strip | Im{¢} | < 1 of the complex t-plane, 
thus a normal family there (Ref. 5, p. 171). We may therefore extract 
from the sequence {f,,(¢)} a subsequence f,,(¢) converging (pointwise) 
in the whole strip, uniformly on any compact subset of the strip, to an 
analytic function f(t); from (9.14), 


ft) =g(), #< 90. 
But g(t) vanishes on an interval without vanishing identically, and so 
cannot coincide with an analytic function. We have reached a contradic- 
tion, and part B follows. Theorem 9 is established. 
10. Theorem 10: Let f(t) € E(er).:-Then an estimate of the form 


ee F sin r(2Wt — k) q 
{ax} uicwrin  e(2Wt — ke) 


cannot be valid independently of er, no matter how large the constants C 
and N are chosen. 
Proof: Without loss of generality we may take W = 3, to simplify no- 
tation. 

Any function f € @ has the (sampling series) expansion f(t) = 
>? f(k) [sin x(t — )\/[r(t — k)]. Since the functions 


2 
= Cer 














sin r(t — k) 
w(t — k) 


g(t) = are orthonormal, 


(10.1) 


2 


= Deh) 


kl >(7/2)4-N 











f — 1 Ok Gk 
IkI<S(7/2)4N 


min 

{ax} 

Now consider the function [sin r(¢ — N — 1)]/[x(t — N — 1)] which 

is in ®. By Theorem 9, we may approximate its restriction to ¢ > 0 ar- 

bitrarily closely in £°(0,0) by finite linear combinations of the func- 

tions [sin r(é — n)]/[r(t — n)], n S —1. That is, given 7 > 0, there 
exists constants a_4,...,d_m (depending on 7) such that 


(10.2) I : 


Let 





sin r(t — N — 1) = so sin r(t — &) | 
a(t — N — 1) ~ 2a, % a(t — k) 





dt < ». 


sin *(¢— N— 1) _ sin r(t — k) | 


09) * LOS GN ty a ee 
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the function ¢, € ®, and || ¢, |?’ =1+ do -21|a |? = 1. Since in par- 
ticular ¢, is in £?(— 0,0) we may choose an integer T/ 2 so large that 
(10.4) in | en(t) [Pdt < 7. 
Now set 

fi) = oP, 


We sce that f € 6 and ||f || = 1. Furthermore, by (10.2) and (10.4), 


|en(t) dt + f | ey (t) | de 
[6 Pa = THES, Inere ee 
jel >7/2 | Pr | | Gn | 


so that f € E(er), with er = (+/2n/|| ¢, ||); we observe that er can be 
made arbitrarily small by choosing 7 small, since || ¢, || = 1. By defini- 
tion 


| f(k) P 


[kl >(7/2)+N 








[2 | ¢,(k) |’ a o25 4, | eal) | di 


~ | Pn TE 
whence by (10.1) 


: : 1 
min ae 
{ax} 


2 
= €7 














fee 
Ik} S(7/2)4+N 
Since 7 may be arbitrarily small, Theorem 10 follows. 
11. Theorem 11. For any B < 1, there exists 6 > O and er such that 


. .( k \sin cr(2QWt—k 
[7 iad (4) a(2Wt — k) 


for some f € E(er). 
Proof: We again take W = 3 without loss of generality. We follow a 
line of reasoning used in Ref. 9. 


2 
SA ASE. 





1 
t+n 





Let g(t) = sin at or 
Tn 


Then 








1 ° dt = J of” 2 POU. a 
Tl oe < Dom = Pf fa < , fe 96° 


Now if P, N > 0, with P > N + 1, then 
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P oe) foe) P - 2 
fogs eee a. 
N 


n=1 m=1 YN (t + n)(t + m) nite mite 
— i dt 1 1 
> 52d wae (t+ n)(t + m) nite mite’ 


(cf. Ref. 9, p. 309) 


so 
rs r 


is al af dy TEE d 


since 1/ < + 2)]is monotone iia in x. Now 





] aa, a. x) ~ pF ah aR cea +1) 


ae du 7m i du | 
te} Jo wite(u + 1) 0 uty + 1) 


qs 1 x | ee du 
el tes on apte(a, + 1) 
: sin 5 (1 + 2¢) m u(u + 1) 


fort > N + 3 





amir + 0(6) + O(N) 








Therefore 
[oema>t] Mato +00 +0P) 
ef N+} 
xri/1/4i ax 
=F [2 Ga-&)| G+ 0 + 00 + 00), 


whence 


P 
2 
thd 
i Aaa € ) 
“ = 5 \ ye Be 
I roa oN 
plus smaller terms. If P = N“fora > 1, and ¢ = log a/[(a — 1) log N, 
then 


1 1 —2/(a—1) —2a/ (a—1) 
Ne —_— px = @ —a > 0. 





Hence 
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‘a g(t) dt 
[ g(t) dt 


remains bounded from 0 as N becomes large, by some k (depending on 
a), so that 


” Pl) dt — "(t) d ” g(t) dt 
[aoa Jc a> bf oo ae 


or 


1 


(11.1) fe g(t) dt > lhe om g (t) dt. 


But g(t) = g(t) — Der g(n)[sin x(t — n)]/[r(é — m)] since g(n) = 0, 
n = 1,2,3,---. Therefore, letting 


so) = (e+ [PF], 


we find, using (11.1), 
[rw dt = [0 Se Se 


[ny < [(N*+N)/2] a(t — n) 





2 


dt 








2 
ay f (t) dt. 
bt | > ((Wa—N)/2) 
If we now think of [(N* — N)/2] as T, then [(N* + N)/2] behaves like 
T’ + T, for B = 1/a,and with 6 = (1 — k)’ — 1, Theorem 11 is proved. 
12. We turn next to the case in which f(¢) is neither precisely band- 
limited nor precisely time-limited. 
Theorem 12: If f(t) € & with || f || = 1, and if 
|| Df |) =1- er 
and 


| BFP =1-— 47, 


then for some constants a, = a,(f), 


2Wr 


lewr) 
lf - do AnWn ||” S 12(er + nw)” + nw. 
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Proof: The decomposition 


(12.1) f= BF eG Bf) 

expresses f as the sum of its components in @ and orthogonal to @ re- 

spectively. The Pythagorean theorem then yields 1 = || f ||’? = || Bf ||? + 

| f — Bf |, whence || f — Bf || = aw. Similarly, || f — Df || = er. 
Let g = Bf/+/1 — 72, , so that g € Band ||g || = 1. We will apply 

Theorem 3 to g; to do so, we must estimate its degree of concentration. 

We first expand 


|| Df — DBf 





|’ = (Df — DBf,Df — DBf) 
= || Df | + || DBf |) — 2Re(Df,DBf). 


Moreover, since || Df — Bf || Ss || f — Df || + lf — Bfl| = er + aw, 
we find 


(12.3) (er+ aw)” = || Df — Bf ||’ = || Df |’ + || Br |’ — 2Re(Ds,Bf). 


Since D is a projection, (Df,DBf) = (Df,Bf); hence subtracting (12.3) 
from (12.2), 


|| DBs |’ — || Bf ll? = || Df — DBF |? — (er + aw)’, 





(12.2) 


or 
| Dg |? = | DBF I > | 4 Ir _ ter + nw) 
(12.4) 1 — nw LS nw 1 — nw 
Ss (er + nv) 
os nw 


Consequently, by Theorem 3, there exist constants b, such that 


(2WT] 2 
(12.5) lg — DO bev |? S 12 AL 
0 1 — nw 


Now from (12.1) 
f [2WwT} [2WwT7] f — Bf 
ee ae ala a ede 
W 
and the bracketed terms remain orthogonal. Thus, with 
a = V1 = niv Ok ? 
(2wr 


] 
lf - > apy ||? S 12(er + aw)? + ay. 
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Theorem 12 is established. 

We should point out that by letting g = Df/1/1 — ¢f and working 
with the functions Dy; , the roles of « and 7 may be interchanged, to 
yield the inequality 


[2wT7] 
If — 2s ceDye ||? S 12(er + aw)? + er. 
13. Theorem 13: If f(t) € & with || f || = 1, || Df | = 1 -— 4, 
I| Bf I? =1 — av; 
then for some constants c, = cx(f), 


sin r(2Wt — k) le 


. < 2 
[kl < Wr+l ce n(2Wt — k) S (er + nw) 





Il f — 


+ nw + ler + aw) V1 = ny. 


Proof: We proceed as in Theorem 12, up to (12.4) but now apply The- 
orem 2 instead of Theorem 3. Thus, for some constants Db, , 


sin r(2Wt — k) 2 


lg — [k] <Wrtl \ a(2Wt — k) 


A 


wings er + nw ee (er = nw)” 
ON nw 1 — nw 
Replacing (12.5) by (13.1) and applying without change the rest of 
the proof of Theorem 12 establishes Theorem 13. 
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A Method for Simplifying 


Boolean Functions 


By A. H. SCHEINMAN 
(Manuscript received March 6, 1962) 


This article presents an iterative technique for simplifying Boolean func- 
tions. The method enables the user to obtain prime implicants by simple 
operations on a set of decimal numbers which describe the function. This 
technique may be used for functions of any number of variables. 


At the present time, although several design aids have been intro- 
duced,!:?:3 the synthesis of switching circuits remains a highly developed 
art, the theory being of only limited value to the circuit designer. In 
particular, that part of the design process involving the simplification of 
Boolean functions having large numbers of variables still presents a 
major problem. 

Probably the best method currently available for the solution of such 
problems is the Quine-McCluskey Tabular Method, which consists of 
the exhaustive comparison of terms of the standard sum for adjacencies 
— terms which differ in only one variable. This technique, besides being 
a long, tedious one, determines all possible prime implicants of the given 
function. Thus a second problem is generated — selecting the essential 
prime implicants from among those found. This in itself is often a dif- 
ficult procedure, for which specific methods have been developed.! 

The method to be described in this paper is a simple, iterative tech- 
nique for determining the prime implicants of a Boolean function. All 
of the essential prime implicants are found with this method, and in 
general, some or all of the nonessential prime implicants are auto- 
matically eliminated, materially simplifying the final search for the es- 
sential prime implicants. 


Any Boolean function, say f(vi, --- , 2) for example, may be ex- 
panded into the form 
J his ae, “ey Aa, > 2 a fe Ta, °"*", In) eo a1 -f(0, v2, 7", Ta). 
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The above theorem is generally referred to as the expansion theorem! 
and enables one to expand any n-variable switching function about 
any one of the variables. As shown above the given function is said to 
be expanded about the variable x; . Two new functions are thus formed, 
one multiplying xz; and one multiplying x1’. The two new switching 
functions are functions of we, --- , 2, only, and are referred to as the 
residues of x1. 

Let us assume that the given function is specified in its canonical form 
as a sum of product terms, 


fer, #2, +++ te) = LPs 


where P; represents the general product term and the subscript 7 is 
the decimal equivalent of the associated product term, when the product 
terms are expressed in binary form with primed literals replaced by zeros 
and unprimed literals replaced by ones. This is often written as 


f(a1, €2,°**,%n) = >oj,  P being implied. 


Assume further that the variables are assigned binary weights in the 
order shown, with x; being assigned the highest weight and x, the lowest. 
When the variable weights are assigned in this manner the expansion 
becomes a very simple process. In particular, if a specific series of decimal 
numbers characterizes the function, the residues may be formed as 
follows: 


f(a, 2, os a) => j=1Rit+ ay Re. 


PR» is a sum consisting of the decimal numbers which are smaller than 
the weight of x; (the variable being expanded about), and A; is a sum 
consisting of the numbers which are greater than the weight of 21 , from 
each of which the weight of x is subtracted. 

Noting that the residues 2; and R, are now each specified by summa- 
tions, and the new sets of decimal numbers are interpreted as being 
functions of all succeeding variables, generically we may write: 


R = fe, 2s, py Bade 


Each of the residues can now be expanded about x2, which is the 
highest weighted variable of the residues. Thus the expansion can be 
carried out in this manner using nothing more complicated than the 
subtraction process. 

As an example, assume the following function: 


f(A,B,C,D) = >5(0,1,8,9,10) = AR; + A’/Re 
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where the order of the variables (A,B,C,D) indicates the relative binary 
weights, i.c., A is the highest weighted and D is the lowest weighted 
variable. 


R, = f(B,C,D) = >5 0,1 and 
Ry =a f(B,C,D) == > (8-8), (9-8), (10-8) = » 0,1,2 


since the binary weight of A = 8. 

This process may be repeated by expanding each of the residues about 
B. At this point let us examine R,; and R2. 

There are five possibilities: 


(a) Ry = Re = R, therefore 
f(A,B,C,D) = AR+ A’/R=R 


indicating that A and A’ are redundant 
(b) Ri > Ry, indicating that the decimal numbers representing R,z 
form a subset of those representing R,, then Ry = R, + Ry, and 


{(A,B,C,D) = A(Re+ Ry) + A’Re = AR, + AR, + A’Re = Ro + ARy. 
(c) Re > Ry, where Re = Ri + Ra 
{(A,B,C,D) = AR, + A(Ri + BR.) = 
AR, + A’Ri+ A’Re = Ri + A’Ra. 
(d) The residues have no numbers in common, in which case 
f(A,B,C,D) = AR, + A’Re. 
(e) Some of the numbers in each residue are the same, 
Ri =k. + Ra, k,= Rh + R. and 
S(A,B,C,D) = A(R. + Ra) + A(R. + KR.) 
= AR, + ARy + A’R. + A’'R. 
= Rh, + AR. + A’'R.. 
Note that in this case the function may be written as 
F(A,B,C,D) = Re + A(R. + Ra) + A(R. + PR.) 


The summation R, may be included with A and A’ as a redundancy. 
This is significant in the method to be shown, because #, may con- 
tribute to the simplification of residues resulting from subsequent 
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expansion. This is analogous to using a term of the standard sum in 
more than one subcube when using a Karnaugh map. 

The following method employs all of the above ideas and will be shown 
by means of an example. 

Assume a function 


f(A,B,C,D) = >51,2,3,4,6,7,8,9,11,12,13,14. 


Expansion about the variable A — the highest weighted variable — 
yields 


f(A,B,C,D) = We 1,2,3,4,6,7) + A. 0,1,3,4,5,6) 
= >> 1,3,4,6 + A’ (95 1,2,3,4,6,7) + A(>> 0,1,3,4,5,6). 


Where the summation >> 1,3,4,6 is that part of the A residues which 
are the same, note that the numbers 1,3,4,6 are now redundant in the 
summations multiplying A and A’. The summation 1,3,4,6 actually 
represents those terms of the standard sum which differ only in the A 
variable. 

There are now three summations, each of which is a function of B,C,D 
only. Each of these functions must now be expanded about the variable 
B. These operations are repeated exactly until the expansion about all 
variables is complete. 

At this point the above illustration will be repeated, showing a 
mechanical technique to organize and simplify the procedure. Refer to 
Tig. 1. 

Step 1. Arrange the decimal numbers representing the given function 
in a column. 

Step 2. Divide the decimal numbers into two columnar groups, one 
headed with A’ and one headed with A. The A’ column contains the 
numbers of the original function which are smaller than 8 — the binary 
weight of A —- and the A column contains the numbers which are equal 
to or greater than 8, first subtracting 8 from each. 

Step 3. Include a third column, headed by a dash to indicate the 
redundancy of A and A’, consisting of the numbers which are common to 
columns A and A’. Check the corresponding numbers in columns A and 
A’ to record the fact that they are redundant. If any of the numbers in 
the dashed column have been previously checked in both the A and A’ 
columns, they should also be checked in the dashed column. | 

Step 4. Examine each column. If any column consists of only checked 
numbers, eliminate the column entirely. 

Each of the columns must now be expanded about B by repeating the 
above steps. The expansion of the function in the A’ column is shown in 
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Fig. 1b. Since the weight of B is 4, all numbers less than 4 are placed 
in the B’ column. These numbers are 1,2,3. Note that 1 and 3 must be 
checked since they were previously checked in the A’ column. The num- 
bers 4,6,7 are placed in the B column, first subtracting 4 from each, giving 
0,2,3. The numbers 0,2 which correspond to 4,6 must be checked: A 
dashed column consisting of the numbers 2,3 is now included. Check the 
numbers 2,3 in both the B and B’ columns. All of the numbers in the B 
and B’ columns are now checked, hence both columns may be eliminated 
as shown. 

Figure 2 illustrates the complete development. When the function is 
expanded about the final variable, note that the residues must be 0. 
At this point the prime implicants may be determined by simply tracing 
a path back to the start and reading the appropriate columnar headings. 

Not all of the prime implicants obtained may be required to describe 
the function. In the example just shown, all of the prime implicants were 
essential, but another example will be shown in which this is not the 
case. 


= > 01,2,3,4,6,7,8,9,11,15 


This function is simplified in Fig. 3, and four prime implicants are 
obtained, not all of which are essential. A simple method for determining 


(a) 


ROpuaCeMNaSWN— 


— me te 


WN | 


see 
AG AG (b) 
ae 


Fig. 1 — Example 1: f(A,B VG, DiS ZX 1,2,3,4,6,7,8,9,11,12,13,14; (a) expansion 
sean ‘A, (b) expansion about 
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1 
2 
3 
4 
6 
7 
8 
9 
WW 
12 
13 
14 
A’ A 
1v (e) 
2 tv 
3Vv av 


BOB - OB Os B 
| om nH 
° gl lad 
ees ee D D! 
AC AC' - , 


Fea, c,D,) = X 1,2,3,4,6,7,8,9,11,12,13,14 = AIC +AC'+ B'D + BD! 
Fig. 2 — Example 1 completed. 


the essential prime implicants is available in a prime implicant chart 
first proposed by Quine and later simplified by McCluskey. Such a chart 
is shown in Fig. 4 for the example of Fig. 3 (see Ref. 1). 

The chart requires the establishment of columns, each of which repre- 
sents one of the decimal numbers of the original function and is so 
headed. Each row represents one of the p:ime implicants and is thus 
identified. 

Each prime implicant is a combination of 2* of the decimal numbers 
of the original function; k may be any integer, including zero. It is an 
easy matter to find these numbers. One could, for example, trace back- 
ward from the final residue of the prime implicant (always zero) and if 
a column is passed through which is headed by an unprimed variable 
add the weight of the variable, if through a primed column the number 
remains unchanged. When tracing through a column headed by a dash 
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+ 


A! a 
oY ey 0 
Bos i 
Mt tA 7 
6 

TV 


x 
ee 
We 


ae es, 2, av 5 
rt co 
AGE an aan 

LC ea. | | 
Sen ae 4 
A'p! Bic’ B'D A 


Fig. 3 — Example 2: f(A,B,C,D) =  0,1,2,3,4,6,7,8,9,11,15; example yielding 
nonessential prime implicants. 


each number becomes two numbers, one of which is the same as the 
number in the dashed column. The other number is obtained by adding 
the weight of the variable from whose expansion the dashed column re- 
sulted. 

On each row of the chart, mark a cross under the decimal numbers 


Oi) 6 Be a ee ee eh A 98 
A'D' (0,2,4,6)* 
B'C'(0, 1,8,9)* 
B'D (1, 3,9,11) 
ae ees oe CD (3, 7,11,15)* 
* PRIMARY BASIS ROW = ESSENTIAL TERM 


fia,B,c,d,) = © %1,2,3,4,6,7,8,9, 11,15 = AD'+ BIC’+ CD 


Fig. 4 — Chart method of obtaining essential prime implicants for Example 2. 
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associated with the terms contained in the prime implicant represented 
by that row. Then scan the columns and circle the crosses which stand 
alone in a column, and rule a line through each associated row. Such rows 
represent essential prime implicants. Now rule a vertical line through 
each cross in the ruled rows. If all other crosses in the chart are not thus 
ruled out by the vertical lines, additional prime implicants must be 
chosen. (See Ref. 1 for further discussion of prime implicant charts.) 
The chart of Fig. 4 shows that A’D’, B’C’, and CD are essential and the 
function may therefore be expressed by the following minimum sum 


f(A,B,C,D) = A'D’ + B'C’ + CD. 


If the function of Fig. 3 is simplified by other tabular methods, it will 
be noted that there are six possible prime implicants. This illustrates 
an important advantage of the method described here, and this is that 
while all essential prime implicants are obtained, some or all of the non- 
essential prime implicants may be eliminated automatically, materially 
simplifying the search for the essential terms by: charting. The missing 
prime implicants are actually included in columns which were crossed 
out in the development because all elements were checked, indicating 
that the prime implicants if obtained would be redundant. 

There are functions for which, if all possible prime implicants are 
found, charts would be produced which are cyclic in form. There are 
no immediately apparent choices of prime implicants which would yield 
a minimum sum. In such cases some initial choice must be made, and 
some cut and try is necessary. In fact, there will generally be more than 
one equally satisfactory solution, depending upon the initial choice. 

When the method described in this paper is applied to functions which 
would normally produce a cyclical prime implicant chart, the chart 
obtained will not be cyclical. Some of the prime implicants will be elim- 
inated as the work progresses, in effect making the initial choices auto- 
matically. This has two results. The chart is materially simplified and 
the solution is easily and automatically obtained. However if a particular 
initial choice would result in a more economical solution than another, 
then this method may or may not obtain the minimum sum, depending 
upon the weighting of the variables. The final solution in such cases 
must be regarded as an approximation to the minimum sum. The ap- 
proximation will always be a close one. 

If some of the decimal numbers specifying the given function are 
“don’t care” terms, they must be checked initially and subsequently 
treated like any other checked number. The only real difference is noted 
when drawing a prime implicant chart, where columns corresponding 
to “don’t care” terms would not be included (Refs. 1, 4). 


EN 
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A final example is included in Figure 5 to indicate which of the decimal 
numbers of the given function are combined to obtain any particular 
number in the expansion. The decimal numbers of the given function 
are shown parenthetically. 

In conclusion the method described above offers the following ad- 
vantages: 

(a) The decimal numbers specifying the function may be operated 
on directly without any preliminary grouping. 


x 


AG 2(10)/ 2 (2,10) 

Be 3(11)/ 3 (3,11) 

y 4 (12) 6 (6,14) 

9 5 (13) 7 (7,15) 
TV 


B vee AO aS = 


oz) 10)” 6,14)¥ —-2.(2,6,10,14) 
1 (13) 10,14)” ao Aen 3 (3,7,11,15) 


Be 


xe 


2(14)” 11,15)7 
3(15)¥ aM 
= Cc 
12) 7 Baw 0(12,14) 0(2,6,10,14) 
3)” CU artis)v 1(13, 15) 1(3,7,11,15) 


12,14)/ Lot,15)/ 0 (12,13,14,15) Dy es oe 0 (2,3,6,7, 10, 11,14, 15) 


AB Cc 


2 3 10 1 12 «13 14 15 
C= 2,3,6,7,10, 11,14, 15* 
—S-8 > — AB = F& 12,13, 14,15* 
f(a,B,C,D) = © 2,3,10,11,12,13,14,15 + Lg 1,6,7 
= C+AB 
Fig. 5 — Example 3: f(A,B,C,D) = 2% 1,2,3,6,7,10,11,12,13,14,15; numbers in 


parentheses are decimal numbers which were combined to give numbers in expan- 
sion. r 


1346 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1962 


(b) Adjacencies (terms which differ in only one variable) are found 
in groups, making this method very rapid in use. 

(c) The operations are simple and iterative, enabling functions hav- 
ing any number of variables to be simplified. 

(d) Some nonessential prime implicants will in general be eliminated, 
simplifying the prime implicant chart. 
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Generalized Confocal Resonator Theory 


By G. D. BOYD and H. KOGELNIK 


(Manuscript received March 5, 1962) 


The theory of the confocal resonator is extended to include the effect of 
unequal aperture size and unequal radii of curvature of the two reflectors. 
The latter 1s equivalent to a pertodic sequence of lenses with unequal focal 
lengths. This treatment is in Cartesian coordinates as previously used. In 
an appendix the modes and resonant formulas are written in cylindrical 
coordinates. 

The effect of unequal aperture size of the two reflectors 1s shown to pro- 
duce mode patterns of unequal size on the two reflectors of a confocal reso- 
nator. The previous computations for diffraction losses are found to be 
applicable. Generalization of the theory to the case of reflectors of unequal 
curvature shows the existence of low-loss regions and high-loss regions as 
the reflector spacing is varied. One of the high diffraction loss regions occurs 
when the reflector spacing is between the two unequal radii of curvature. 
Such a region ts interpretable in terms of instabilities in a periodic sequence 
of lenses of unequal focal length. An estimate of diffraction losses is obtained 
for the low-loss regions. The presence of a high diffraction loss region or un- 
stable region should be of importance in the design of resonators or of a per- 
todic sequence of lenses. 


I. INTRODUCTION 


The existence of modes in an open structure such as the confocal 
Fabry-Perot type resonator has been demonstrated by Boyd and Gor- 
don! and by Fox and Li.? This resonator consists of two spherical reflec- 
tors separated by their common radius of curvature, as shown in Fig. 1. 
The reflectors were assumed to be of equal aperture and square! or circu- 
lar? if viewed in the z-direction. Uniform reflectivity over the reflecting 
surface was postulated. Goubau and Schwering*: 4 have also reported on 
this problem and have obtained similar results. 

A mode may be defined as a field distribution that reproduces itself 
in spatial distribution and phase, though not in amplitude, as the wave 
bounces back and forth between the two reflectors. Because of losses 
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Fig. 1 — Confocal resonator with spherical reflectors. 


due to diffraction and reflection, the reproduced pattern is reduced in in- 
tensity on each succeeding traversal of the resonator. The above-men- 
tioned authors have shown that there is a set of modes which will re- 
produce themselves over the equal apertures A; and Ag of the resonator. 

Mathematically the modes of the confocal resonator form a complete 
orthogonal set of functions. For the confocal resonator these modes are 
highly degenerate in frequency; that is, many modes have the same 
resonance frequency. The degeneracy is split when the resonator is made 
nonconfocal by varying the plate spacing, though new degeneracies do 
appear at certain other spacings. Because of this frequency degeneracy, 
the modes of the confocal resonator are not unique unless the effects 
of loss are considered. Any linear combination of the degenerate fre- 
quency modes (the Hermite-Gaussian functions described in Ref. 1) is 
still a mode of the resonator. 

When one includes the effect of diffraction losses due to finite aper- 
tures, the modes become unique, for then the eigenvalue degeneracy is 
split and cach modc has its own characteristic rate of decay or Q. For 
the case of low diffraction losses, the eigenfunctions of the modes are 
still given with good approximation by the Hermite-Gaussian functions, 
which are exact only for the lossless case of infinite apertures. The fre- 
quency degeneracy is unaffected by the inclusion of diffraction losses. 

Boyd and Gordon, in including diffraction losses, considered only the 
case where both reflectors, A1 and Ag, are of equal size. This imposes a 
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certain symmetry on the system. If, however, the two reflectors are of 
different sizes, one might expect for the confocal resonator, with its high 
frequency degeneracy, stationary field configurations that are asym- 
metric in the z-direction. This may be understood by considering the set 
of degenerate Hermite-Gaussian modes which are resonant at a frequency 
given by 2¢ + m + n equal a constant. Combinations of these modes 
may be superimposed at one reflector to form various new field patterns. 
The field patterns on the two reflectors can now be different since the 
original modes with even and odd q change their relative phase by 180° 
in going from one reflector to the other. It is reasonable that the lowest- 
loss mode for an unequal aperture resonator will be such a combination 
that the field patterns will be asymmetrical. This also turns out to be 
true for all higher-order modes. 

For the case of the nonconfocal resonator with spacing such that there 
are no frequency degeneracies, this asymmetry in the stationary field 
configurations is not possible. The field distribution is forced to be sym- 
metrical between the two reflectors. The diffraction losses are then de- 
termined mainly by the smaller of the reflectors. 

Resonators with reflectors of different radii of curvature are investi- 
gated also. A region of high diffraction loss is found for a range of sepa- 
ration of reflectors with unequal curvature near the confocal separation. 
This has some practical significance for resonators and transmission sys- 
tems in that one must be sure to operate in only the low-loss region. Due 
to the possibility of slightly unequal radii of curvature in the fabrica- 
tion of resonators, it is desirable to space the reflectors to obtain a 
nonconfocal condition. The existence of ‘‘stable regions” of low loss and 
“unstable regions” of high loss as the reflector spacing is varied is inter- 
pretable in terms of the stable and unstable regions of a periodic sequence 
of lenses of unequal focal length. 


II. MODES IN A LOSSLESS CONFOCAL RESONATOR 


It was pointed out in the introduction that the modes of the lossless 
confocal resonator are highly degenerate in frequency and thus not 
unique. Boyd and Gordon, in describing the modes of the lossless con- 
focal resonator in terms of Hermite-Gaussian functions, considered only 
the symmetrical situation of identical field patterns and spot sizes over 
each aperture. Because of the high degeneracy of the lossless resonator, 
asymmetric field patterns between the two reflectors are just as pos- 
sible. In this section the relation between asymmetric spot sizes is ob- 
tained. Only the introduction in the following section of unequal aper- 
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tures and the resulting diffraction losses will allow one to state which 
combination of asymmetric spot sizes is a unique mode of the system. 

Boyd and Gordon have computed surfaces of constant phase within 
and without the confocal resonator. These surfaces have approximately 
a spherical shape. Any of these surfaces may be replaced by spherical 
reflectors to form a new resonating structure of arbitrary spacing and 
curvature. Except for the obvious special case, such a resonator was 
termed nonconfocal. Boyd and Gordon have shown that each confocal 
system of radius of curvature and separation equal to b generates a set 
of surfaces of constant phase of radius b’ and separation d linked by the 
relation* 


d’ — 2db' + 0 = 0. (1) 


For a given b and 0b’ there are two possible reflector separations, d; and 
ds : 


d= + Ve? —B, 
and (2) 
d, = b' — ~/b? — Bb. 


The field distribution of the modes of these nonconfocal systems is 
symmetric with respect to the system center (as are the fields of the 
generating confocal system). The fundamental modes of all these 
nonconfocal systems have a spot size of radius w, at the center of the 
resonator, given by 

ON 


Wo = =?) 
2a 


(3) 


where \ is the wavelength. The spot size of the fundamental mode of 
the confocal system at the reflectors is w, = wor/2. In general, the 
spot size at a distance d/2 from the center is given by 


9 
ww! = a 4/1 + & (4) 


The surface of constant phase at the center (2 = 0) is a plane, and 
the whole family of surfaces is symmetric with respect to it. So far we 


* The notation used here is consistent with that of Boyd and Gordon but unfor- 
tunately not with that of Fox and Li, who use b for the spacing of the plane paral- 
lel resonator. We use d for the spacing of the reflectors, b for the confocal radius 
of curvature and thus its spacing, and b’ for the radius of curvature of a surface 
of constant phase and for the radius of curvature of the reflectors of a nonconfocal 
resonator with identically curved reflectors. For the nonconfocal resonator with 
unequal radii of curvature we use bi and be. 
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have arranged the reflector pair symmetrically to this plane, and sym- 
metric mode configurations have resulted. It is possible, however, to 
construct a field configuration which is asymmetric with respect to the 
reflector system by placing one of the reflectors (with radius of curvature 
b’) at 2 = d,/2 and the other at z = —d./2. At these locations both 
surfaces of constant phase have the same radius of curvature b’. This 
is indicated in Fig. 2(a). The resulting reflector separation can be 
computed from (2) as | 


d = 3(di + dz) = O’. (5) 


Since the spacing d equals the reflector radius of curvature 0’, it is 
apparent that a new confocal system has been formed. Apart from the 
surfaces of constant phase, consideration of spot sizes assures us that the 











(a) 





~b/2 


(b) 


Fig. 2 — (a) Surfaces of constant phase including asymmetric confocal sys- 
; —— 


tems; (b) relative spot size — = 1+ —. 
oO 
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modes obtained for this confocal system are, indeed, asymmetric. As 
indicated in Fig. 2(b) the spot size reaches its minimum value of w, 
at z = 0 and is not by any means in the center of the reflector system. 
The spot sizes at the new reflectors can be computed from (4) as 


dy? 
m 4/14, 


UW= 
4 ade a 
W2 = Wo / 1+ ok 
This combined with (2) yields the relation 
Wil, = —, (7) 
T 


where +/)b’/z is the spot size at the reflectors, which we would expect 
for the symmetrical set of modes of a confocal system of spacing 0b’. 

The resonance condition for this system is obtained via Boyd and 
Gordon’s equation (20) after some computation as 

4p’ 
y= qt A+ m+n), (8) 
where m, n, and g are the mode numbers as defined in Boyd and Gordon’s 
work. By comparing their equation (14) with our result, we find that 
the resonance conditions for the symmetric and the asymmetric modes of 
the confocal system are identical, as expected. 

By suitably choosing b for a given reflector curvature b’, almost any 
ratio of reflector spot sizes w,/we can be obtained. Thus, for a given 
lossless confocal system, the confocal geometry allows an infinite number 
of sets of modes (characterized by the spot sizes at each aperture). It is 
the finite size and shape of the reflector that selects one particular set, 
as we shall see in the following section. 


Ill. MODES OF A CONFOCAL RESONATOR WITH REFLECTOR SIZES UNEQUAL 


Consider a confocal resonator. Assume that the reflectors A; and A» 
are, in general, of different sizes and/or shapes. With this asymmetry 
in mind, it is no longer reasonable to postulate that the field pattern 
on A, be reproduced on Az when looking for self-consistent field con- 
figurations. Instead, as a more generalized definition of a mode let us 
require that an energy distribution launched with a certain pattern on 
A, reproduce this pattern on A; after bouncing back from A, . No con- 
dition on the pattern on A, is imposed. 


GENERALIZED CONFOCAL RESONATOR THEORY 1353 


To express this mathematically we use the approximations of Boyd 
and Gordon’s paper and the scalar formulation of Huygens’ principle. 
A wave leaving reflector A, with a field pattern L(x, y) arrives at A» 
with a pattern E’(2’, y’) given by 


Bay) = ef aca BG, De™, (9) 
2xb Aj 
where 0 is the mirror separation, k = 2a/\, and 
1, . 
p= b —= (Ga + Gy’). (10) 


Most of the energy is reflected from A, and travels back to A,. The 
radii of curvature of the reflectors are assumed very large compared to 
the wavelength \, and we can therefore assume that laws for the reflection 
of plane waves apply locally. Then we find that the reflected wave 
leaves Az with the pattern —H’(2’, y’). It will arrive at A; witha certain 
distribution pattern which we shall call —cp’ ¢,” E(x, y). At this point 
we have introduced the postulate that the field patterns be reproduced, 
except for the amplitude factor o, o,°, after one complete return trip. 
Again, the energy is bounced back and leaves reflector A; with a field 
distribution which can be expressed in terms of E’(x’, y’) as 


Om on (x,y) = ae dx’ dy’ E'(a' ye", (11) 
2rb Ag 
with 
/ 1 7 / ‘ 
ee ae + yy). (12) 


Substituting (9) into (11) to eliminate EH’, then inserting the expressions 
(10) and (12) for p and »’, and, finally, interchanging integrals, one 
obtains an integral equation 

ooo 


2 2 





[ dag Be, DK@,%1,0), (13) 
Al 
with the kernel 
= = / / .k 7 - 7 = 
K(a,a;u,9) = [ da’ ay exp (i Fl (x + 4) +¥u+D)), (14) 


This is the fundamental integral equation that yields as its solution 
the modes of our system and their diffraction losses. The kernel K(x, 2; 


1354 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1962 


y, 9) depends on the shape and size of reflector A» and will be evaluated 
for some special cases in the following sections. 

Integral equations for the modes of asymmetric nonconfocal esto 
can be derived on the basis of arguments similar to the ones used in 
this chapter, but solutions for them are in general not available. 


IV. CONFOCAL RESONATORS WITH UNEQUAL SQUARE AND RECTANGULAR 
APERTURES 


In this section a confocal resonator with two reflectors of finite but 
unequal size is considered. Let reflector A; extend from —a, to +a, in 
the x direction and from —<A,; to +A, in the y direction as shown in 
Fig. 3(a). Reflector A» is chosen to be of a rectangular shape i by 
2A. correspondingly. 

For the above reflector dimensions, the kernel of integral sannecn 
(13) takes the form 


K(x,8;u9) =| ada’ [ __ dy exp (he@ra + +a)), (15) 





REFLECTOR Ay (a) 





(b) 


Fig. 3 — (a) Reflectors with rectangular aperture; (b) reflector Az blocked in 
center. 
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These integrals can be evaluated oo and the kernel rewritten as 


2 sin = * axle + £) sin = © Aaly + 9) 


4b (16) 
K(2,2#;y,9) = ae Se 
ee (x + @) (y+ 9) 
Assume that the field pattern of a mode can be written in the form 
E(x, y) = Lofm(x)gn(y); (17) 


with E, a constant amplitude factor, f(x) a function of x only, and 
g(y) a function of y only. Under these conditions integral equation (13) 
can be rearranged as 


sin ; ao(a + 2) 


|a(a + Z) a3) 


2ikb ba 
Cm On "fn(X)Gn(y) = os [ dE fm (#) 
ee = 
Al sin = As(y + 9) 
fo 
AL my + 9) 


An integral relation satisfied by the angular prolate spheroidal wave 
functions Son(c, s) is® 


i sin c(t — s) 
2 aa (6, IF Sn(e,) = [ds BEE— 9) s,5), (19) 


where on 1) is the radial prolate spheroidal wave function of 
nth order. Because 


Son(c, 8) = (—-1)" Sole, —s), (20) 
it holds that 


sin c(t + s) 
a(t + s) 


We can compare this relation with integral equation (18) (with ¢ = 
(k/b)aiaz and C = (k/b)AiA2) and conclude that the field patterns of 
the TEM.,,.,q mode of the system are 


on Ai: E(a,y) ~ Som (« =) Son (<, 3) : 


r , 
on Ae: E'(2’,y') < Som (« =) Son @ r) . 
As pointed out in Boyd and Gordon’s work, the prolate spheroidal 


2 Ra (6, DF SC.) = (0 [as Soules). (21) 


(22) 
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wave functions Son can be approximated by orthogonal Hermite func- 
tions if the apertures are large enough. This enables one to derive an 
approximate formula for the dimensions of the ‘‘spot’’ of the funda- 
mental, which indicates the location where the field of this mode has 
decreased by a factor e * with respect to its maximum. For rectangular 
reflectors these quantities will, in general, be different for the x and the 
y directions. One obtains 


a —_ = 
on reflector A; : aay e 4/2: Y, = es 


= _ (23) 
on reflector A2: Ly = a bh ; "Ys = As ba ; 
ay va Ay vie 
Note that 
BeBe = YsYo = Ws. = = (24) 


which agrees with (7). rom the above we see that, compared to a 
confocal resonator with equal apertures, the patterns of all modes on 
reflector A, are now magnified by a factor ~/a;/a, in x direction and 
a factor ~/A;/A,_ in y direction, if a1 > a2 and A; > A,. The patterns 
on reflector A: are compressed correspondingly. 

The center of the reflector system is no longer the position of maxi- 
mum energy density. The position of maximum energy density will, 
in general, be different for the concentration in the x direction as com- 
pared to the concentration in the y direction. One computes displace- 
ments D, and D, of the positions of maximum concentration from the 
center in the direction of the smaller reflector: 


_batmat  , bata at 
2 az + a?’ = 2 A? + A?” 





D, (25) 


Comparison of (18) and (21) also yields the eigenvalues of the integral 
equation (20): 


om. on a ( re Lee a (Rime (c, 1) 7 Ron” (C, Il (26) 


This shows that 
(7) the resonance condition 


4b 


y= 2+ A +m +n) (27) 
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is not changed by making the apertures of the reflectors of a confocal 
system unequal, and 
(it) the diffraction losses of a confocal system with unequal reflector 
apertures of dimensions a; , Ai, a2, and A2 are equal to the diffraction 
losses of a confocal system with equal aperture dimensions a,, A. if 
2 2 
Mo = aa, and Ag = AjAg. 


V. CONFOCAL RESONATOR WITH ONE REFLECTOR PARTIALLY BLOCKED 


In this section we would like to quickly sketch the analytical treatment 
of a confocal reflector system in which one reflector —in our case 
Az— is blacked out in the center as shown in Fig. 3(b). The other 
reflector dimensions are assumed to be the same as in the previous 
section. The effective shape of reflector Az is now that of two rectangles 
of width Aa, extending from x’ = —a:; — (Aa/2) to —a, + (Aa/2), 
and from 2’ = a; — (Aa/2) to a, + (Aa/2). If we insert the corre- 
sponding limits into (14), we obtain the kernel 


8b" . [ kAa : 
Kanu = pep yaeH Le Ot) 


seos| Mas (a + 2) | sin k Aly + » | 


for the integral equation describing the system. 
For very small reflector width Aa « bd/a, the kernel is given with 
good approximation by 


(28) 


4bAa os 
ky+n) 


With this kernel, integral equation (13) can be separated into one 
equation containing functions of « and one containing functions of y 
only, as in the previous section. While the latter is the same as the 
integral equation treated in Section IV, the equation for f(a) is of the 
form 


K (2,4; 4,9) = (2 + &) sin A? (y+). (29) 


7 


yf(a) = 2Aa ee dz f(#) cos — (a + @). (30) 


Applying standard procedures, this integral equation can be solved 
elementarily. The solutions are 


hey = cos A a, (31) 
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with the eigenvalue 





O..6 cK 
y = 2Aa (« + Thaz sin 2 ; cus) ; (32) 
and | 
f(x) = sin Ho, (33) 
with the corresponding eigenvalue 
y = —2Aa (a, = as sin ae ata) : (34) 
2kaz b 


The eigenvalues, of course, determine the diffraction losses and the 
resonance conditions for the reproducing patterns, as in the previous 
section. The resonance formula is 


= =2¢+1i+n (35) 
for the even cosine-function, and 
MO = Og +240 (36) 


for the odd sine-function. 

One should note that the field distribution on reflector A, is simply 
the two-slit diffraction pattern one would expect from coherent excita- 
tion of the two narrow reflectors comprising A: . 


VI. RESONATORS WITH REFLECTORS OF UNEQUAL CURVATURE 


To investigate resonator systems with concave reflectors of unequal 
radii of curvature, let us return to the consideration of a lossless system. 
From this model one can obtain information on spot sizes and resonance 
conditions. Diffraction losses will be estimated using the same approxi- 
mation previously used by Boyd and Gordon for the nonconfocal resona- 
tor of equal curvature. 


6.1 Surfaces of Constant Phase 


Let reflector A; have a radius of curvature b,, and reflector A» a 
radius of b. . We shall base our argument on Boyd and Gordon’s picture 
of surfaces of constant phase (Fig. 2), which we have already used in 
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Section II. In a set of surfaces, characterized by the confocal parameter 
b, reflector A; can be placed at the distances +d,/2 from the center, 
and Az at +d,/2 correspondingly. These distances can be computed from 
(2) as 


dy = bi = vV b? = b?, (37) 
ds —_ be & V be? =. b?. 


With given concave reflectors, therefore, four different resonator systems 
can be found which fit this particular set of surfaces of constant phase. 
The four different reflector separations d = 3(di + d:) are given by 


W=b+tb + Vor — B+ Vb? — B2 (38) 


To obtain various other resonator systems the parameter b can be 
varied. But the range of this variation is restricted, since only real 
valued distances have physical meaning in this context. If we assume 
that b. > b, , it follows from (38) that b can be varied in the range from 
0 to b;. One can thus obtain reflector separations d in the range from 
0 to b; and from be to b; + be as shown in Fig. 4. No information on 
resonators with reflector separations in the range from 0; to b: can be 
obtained. It is of interest that the confocal system for reflectors of 
unequal curvature, with d = 3(b, + be), is just in this “unstable region.” 

Let us restrict our discussion to systems of given b; , be , and d in the 
range covered by the picture of surfaces of constant phase. We can 
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Fig. 4 — Spot sizes and high- and low-loss regions for a resonator with reflec- 
tors of unequal curvature and variable spacing. 
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inquire into the spot sizes w, on A; and w:..on Az. Combining (1), (4), 
(37), and (38) we obtain the relations 


W ‘ bi be —d 
(2) be bp =a 3?) 
2 (r\'__ bibed 
(nen)? = (2) pee a 


It also follows that the maximum concentration of energy occurs 
at the distance 


dy _ be — ad 

Sey oy) 
from reflector A,, and at 

en a ee (42) 


20 bb +b, — 2d 
from A2. 

In Fig. 4 we have shown for a special case how the spot sizes w; and 
W2 vary as a function of the reflector spacing d. In this figure the spot 
size on each reflector is normalized in terms of wa = ~/b:\/7 and w,. = 
4/br/r. These are the spot sizes at the reflectors of equal-radii con- 
focal resonators with radii of b; and be respectively. 

Note that as d approaches b, , the spot size w. on Az approaches zero, 
while the spot size w; on A; increases beyond limit. The corresponding 
effect occurs if d approaches b. from above. It should be remembered 
that the information obtained here can be applied usefully only so long 
as the spot sizes are somewhat smaller than the corresponding reflector 
dimensions. 

The diffraction losses of the nonconfocal resonator of equal radii of 
curvature and aperture were previously estimated by Boyd and Gordon 
on the assumption that the diffraction loss is equal to that of its equiva- 
lent confocal resonator with reflector dimensions scaled up by the ratio 
of their spot sizes. 

Tor the nonconfocal resonator of unequal radii of curvature and 
square apertures of sides 2a; and 2a, respectively, the equivalent Fresnel 
numbers at reflectors A; and Az, which determine the diffraction losses 
at each reflector, are obtained from Boyd and Gordon’s equation (29) as 


@).-abe-G)] 
bv 1 did bi bi ; 
EE- OT 
bd sea b.  \boJ J}? 





(43) 
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where d; and d2 are determined by (41) and (42). The diffraction loss 
at each reflector is then obtainable from Fig. 3 of Boyd and Gordon as 
Qpi and Ap2. 

The resonator Q is given by 


2rd 
where 
a= 3 (api + apo) e QR (45) 


and ap represents the reflection loss per bounce at a reflector plus the 
single-pass scattering and absorption loss between the reflectors. 

On the basis of this estimate, one concludes that the diffraction losses 
increase sharply if the separation d approaches an “unstable” region. 
No similar estimate of diffraction losses is available for the “unstable”’ 
regions. However, a ray optical analysis which we present in the next 
section shows the divergent nature of “unstable” resonator systems. 
This indicates relatively high diffraction losses. 

With results obtained here and the help of Boyd and Gordon’s equa- 
tion (20), the resonance condition for the resonator with reflectors of 
different curvature can be computed as 


fag thatmen os g/(i-N-4), (46) 


To compare this with Boyd and Gordon’s resonance formula for reso- 
nators with equal curvature b’, we rewrite their equation (31) in terms 
of d and 0’: 


tg + htm +n) cos (1 - 8), (47) 
r 1g b’ 
We have found this to be a very convenient form in which to rewrite 
their resonance formula (31). But due to well known relations between 
the trigonometric functions, various other formulations are possible. 
One of these formulations was given by J. R. Pierce.® 

A half nonconfocal resonator may be formed by a plane reflector and 
a spherical reflector of radius of curvature b; and spacing d between the 
reflectors with d < bi < o. The resonant condition may be obtained 
from (46) by letting b. — «. The result is given by 


Fg tht mtn) cost (1-2). (48) 
T by 
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6.2 Equivalent Sequence of Lenses 


Let us call the regions bh + be > d > bhandO <d < “stable” 
or “low loss,’ and the regions d > b; + be and bh < d < be “unstable” 
or “high loss.”” We can understand these stable and unstable regions 
of a resonator system with reflectors of unequal curvature from another 
point of view if we replace the resonator by an equivalent sequence of 
lenses. These lenses are spaced at distances d and have focal lengths of 
fi = 6/2 and fo = b./2 respectively. Lens systems of this type have 
been used in periodic focusing of long electron beams and instabilities 
have been observed. 

Stability investigations of sequences of lenses of equal focal length are 
readily available.” These systems are stable if 


0<7<4, (49) 


where ZL is the lens spacing and f the focal length. 
A pair of lenses of focal lengths f, and f2 spaced at the distance d-can 
be replaced by an equivalent optical system’ of a focal length f given by 
(en Kr d 
SS eee a ee 
f fi fo f: fe 
The system’s principal planes are found to be spaced at distances hy = 
d( f/f.) and hz = d(f/fi) from the corresponding lenses (see Fig. 5). 
If we substitute such a thick lens for each pair of unequal lenses of 
our system, we obtain a sequence of equal optical systems of focal 
length f. If, furthermore, we define as their “effective” spacing 


ie ee eon Pe 51) 


the arguments of Pierce’s treatment’ are applicable to our case. 


ria 


ae Se P 


(50) 


Fig. 5 — Sequence of lenses of alternating focal length 
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Combining equations we obtain 


L 2c 2 d 

—~=d(-+>----). 52) 

j fF a a 
From (49) and (52) one can show that the boundaries of the stable re- 


gions are given by 
d d 
we ae oo <= 
(¢ 1) (4 1) = 1, (53) 


Go (S25) a0. (54) 
(F )( ) 


These relations define the stable and unstable regions in agreement with 
the preceding discussion, but they are also valid for negative values of 
b, and b.. If one allows for convex reflectors, one can also obtain this 
somewhat generalized result from the picture of surfaces of constant 
phase. 


and 


6.3 Stability Diagram 


A. G. Fox and T. Li have suggested a two-dimensional diagram of 
the stable and unstable regions which is very instructive. Several choices 
of coordinates are possible. In Fig. 6 we have plotted d/b; and d/b: as 
coordinates. In this diagram the boundary lines described by (54) ap- 
pear as straight lines, and the curve represented by (53) as a hyper- 
bola, as shown. For confocal systems, i.e., systems with coinciding 
reflector foci, we have 2d = 6, + b., which may be written: 


d 1 d 1 1 
(¢ - 5) (2-5) =7.- <8) 


In our diagram, therefore, these systems are represented by points on 
another hyperbola and fall within the high-loss region. A transition from 
a “stable” to an ‘“‘unstable”’ region means an extremely sharp increase 
of diffraction losses for reasonably large Fresnel numbers. 

’ The confocal system with reflectors of equal curvature is represented 
by a rather singular point in our diagram. We see that certain deviations 
from the ideal dimensions d = b; = be will greatly increase the system 
losses. This should be taken into account when designing maser reso- 
nators or optical transmission systems, and it’ may be advisable to 
choose points of operation at a safe distance from the unstable region. 
The degenerate frequency characteristics of the confocal system can be 
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Fig. 6 — Two-dimensional diagram of stable and unstable regions. 


obtained, if desired, by using a concave reflector (b; = 2d) and a flat 
one (be co) spaced at the distance d = b,/2. This system is in a sta- 
ble region. 


6.4 Lquivalent Systems 


One may ask whether operation in one of the two low-loss regions is 
to be preferred to the other. We have found that the two regions are 
absolutely equivalent as far as diffraction losses, spot sizes, and resonance 
conditions are concerned. For a given reflector spacing d, we can find 
a corresponding pair of resonator systems, one operating in the lower 
stable region with reflector radii bi, and be, < d, and one in the upper 
region with reflector radii biy and vy > d. Spot sizes, diffraction losses, 


GENERALIZED CONFOCAL RESONATOR THEORY 1365 


and frequency response of both systems are the same if the conditions 


iT,i_?2 
bir = biv od 
and (56) 
1,4 -% 
ee ae 


hold. In this case the mode patterns of one resonator correspond to the 
complex conjugates of the mode patterns of the other. 

Some of these conclusions can be drawn on the basis of (39), (40), and 
(46) alone. To prove the correspondence of diffraction losses and mode 
patterns, however, the integral equations of the resonator systems have 
to be investigated using a procedure suggested by T. Li for the ana- 
lysis of nonconfocal resonators with equal reflectors. Examination of 
the integral equations shows that unstable systems also are equiva- 
lent under the conditions given above. 


VII. SUMMARY 


The effects of unequal reflector apertures on the modes of a confocal 
resonator have been discussed. It was found that unequal apertures at 
the two reflectors have a large effect in determining the mode patterns. 
The resonant condition, however, is not changed by an asymmetry of 
this kind. Boyd and Gordon’s picture of surfaces of constant phase does 
not contain nonconfocal systems (of equal curvature) with similar asym- 
metries in spot size. This is because lossless nonconfocal resonators are 
not, except for special cases, degenerate in frequency. This shows that 
the mode patterns of nonconfocal systems are not significantly changed 
if the reflectors are of unequal size but are larger than the spot size at 
the reflectors. 

Tor resonators formed of two reflectors of unequal curvature, unstable 
regions of high loss are shown to exist in that an equivalent sequence of 
lenses becomes defocusing. The true confocal resonator is on the border 
of such unstable regions, though in fact it has minimum diffraction 
losses. Unfortunate deviations from the dimensions of the ideal confocal 
resonator can produce a system of high loss. The implications for the 
design of resonators for gaseous or solid optical masers or of long distance 
optical transmission systems are that the “equal’’ radii of curvature 
should be made slightly larger (or smaller) than the reflector spacing. 
One can also choose to simulate a confocal resonator with one curved 


1366 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1962 


and one flat reflector spaced at half the radius of curvature. This system 
is stable. 


VIII. ACKNOWLEDGMENTS 


Stimulating discussions with and constructive criticism from J. P. 
Gordon, W. W. Rigrod, A. G. Fox and T. Li are sincerely appreciated. 


APPENDIX 


Cylindrical Coordinates 


In this paper and in the paper of Boyd and Gordon! the mathematical 
analyses were based on a system of Cartesian coordinates. Resonators 
with reflectors of square or rectangular aperture have been investigated. 
Tor large apertures the authors were able to obtain approximations for 
the properties of the resonator modes on which many arguments of this 
paper are based. They showed that the mode patterns are describable 
in terms of Hermite-Gaussian functions. 

For certain classes of problems, for instance if it is desired to obtain 
diffraction losses for circular apertures, it is preferable to use a cylindrical 
system of coordinates. Approximate solutions for the modes of resona- 
tors with reflectors of large circular apertures can be obtained from the 
work of Goubau and. Schwering.® The results of these authors are pre- 
sented in terms of hybrid waves, but by suitably combining two hybrid 
waves one obtains modes which for our purposes can be regarded as 
linearly polarized TEM waves. Goubau and Schwering show that the 
mode patterns are describable in terms of associated Laguerre-Gaussian 
functions. Fox and Li® have given asymptotic solutions for the mode pat- 
terns in terms of Sonine’s polynomials, which can be shown to be equiva- 
lent to the above results. 

One can obtain asymptotic solutions for the modes of the confocal 
resonator in cylindrical coordinates by making a scalar wave approxima- 
tion and using Huygens’ principle. This leads to an integral equation 
for the modes of the confocal resonator with reflector spacing and curva- 
ture b in cylindrical coordinates, which has been given in Appendix C 
of Fox and Li. For an infinitely large aperture their result may be 
written in the form 


x{-ViRglt)} = exp | ib + iS (i + )| 
2 (57) 
[ VJ UR p(t’) J (tt) St dt’, 
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where ¢t = rV/(k/b) and r is the radial distance in the plane of the 
aperture. As the solution of the above equation the function Rp:(¢) 
describes the radial dependence of the modes and the angular depend- 
ence is e*"’’, The resonance condition for the individual modes is ob- 
tained from the eigenvalue x of (57). . 

From Magnus and Oberhettinger! one observes that the associated 
Laguerre-Gaussian function is self-reciprocal under the Hankel trans- 
formation. Thus the solution of (57) is given by 


Rylt) « UL,'(P) 6?" (58) 


where Ly (t) is the associated Laguerre polynomial. The associated 
eigenvalue is found to be 


xX = exp | it +i 5 (Qp+1+ | (59) 


which leads to the resonance condition for the confocal resonator with 
q as the longitudinal mode number: 


P= tp tlh. (60) 


The field distribution of the modes inside and outside the confocal 
resonator can be derived from the mode patterns on the reflectors by 
using Huygens’ principle, as in Boyd and Gordon’s paper. The field 
distribution can of course be obtained from Goubau and Schwering’s 
work. Comparing Goubau and Schwering’s equation (5a) with Boyd 
and Gordon’s equation (20), one finds that the surfaces of constant 
phase of a Cartesian TEM,,.,., mode are identical with the surfaces of 
constant phase of a cylindrical TEM,., mode if 


mt+tn= 2p+1. (61) 


The fields and therefore the spot size of the fundamental THMoo, 
Cartesian mode and the fields of the fundamental cylindrical mode 
are identical throughout the resonator. 

The resonance conditions and spot sizes of the modes of resonators 
with large circular apertures can therefore be deduced for nonconfocal 
resonators and resonators with reflectors of unequal radii of curvature, 
in exactly the same fashion as has been done for square apertures. 

We do not propose to present this derivation again, but list as a 
reference some characteristic properties of the modes of resonators with 
large circular apertures, together with properties of the modes of reso- 
nators with large square apertures. It may be worth repeating that an 
aperture is considered “large” for a particular mode if the mode’s 
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energy, as calculated from the approximations below, is well concen- 
trated within the aperture. Only under this condition are the mode’s 
characteristics reasonably well described by the formulae listed below. 


A.1 Approximations for Resonators with Large Circular Apertures. 


A system of cylindrical coordinates (7, g, 2) is used, where the z-axis 
coincides with the resonator’s optical axis. The corresponding modes are 
designated TEM 1, . 


A.1.1 Nonconfocal Resonators with Reflectors of Equal Radius of 
Curvature b’ and a Reflector Spacing d. 


At the reflectors the spot size w,’ of the fundamental THMo, mode 


is given by 
yk Ge ee 
pte "A = (a3) : So 


The relative field distribution (mode pattern) at the reflectors (z = 
+d/2) of a TEM,,, mode is given by 


l 2 
Bove sal) (1) nyt (2 Zeemote (68 


where L,’ are the associated Laguerre polynomials. The mode resonates 
at a wavelength given by 
2d 


4g +h ep tities (1-4), (64) 
r ™ b 


A.1.2 Resonators with Reflectors of Unequal Radii of Curvature b, 
and b, and a Reflector Spacing d. 


The spots of the fundamental mode are in general of different size on 
the two reflectors. We have a spot size w; on the reflector with radius of 
curvature b; and vice versa. In (389) and (40) these quantities have 
been expressed in terms of X, d, b; , and b2. The TEM,:, mode patterns 
on the reflectors are obtained from equation (63) by substituting for 
w,' the corresponding w; or W» . 


The resonance condition is 


_ + =p + 1+ 1) cos” G-5)(1-§)- (65) 
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A.2 Approximations for Resonators with Large Square Apertures. 


A system of Cartesian coordinates (x, y, 2) is used with the z-axis 
coinciding with the resonator axis. The corresponding modes are desig- 
nated TEM mg . 


A.2.1 Nonconfocal Resonators of Equal Radius of Curvature b’ and a 
Reflector Spacing d. 


The spot size w,’ of the fundamental mode at the reflectors is again 
given by (62). The mode pattern of a TEM,,,, mode at the reflectors 
is given by 


Maat — 5, (8). n (WA) ene 
E, Ws Ws 


where the H,, are the Hermitian polynomials. The resonance condition 
for this mode is given by (47). 


A.2.2 Resonators with Reflectors of Unequal Radii of Curvature by and 
bo. and a Reflector Spacing d. 


The spot sizes w; and we of the fundamental mode at the two reflec- 
tors are the same as those discussed in Section A.1.2. The mode patterns 
of the TEM ng mode at the corresponding reflectors are obtained by 
substituting w, or we for w,’ in (66). The resonance condition is given 
by (46). 
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A Unidirectional Traveling-Wave 
Optical Maser* 


By J. E. GEUSIC and H. E. D. SCOVIL 
(Manuscript received March 20, 1962) 


The basic ideas leading to a unidirectional traveling-wave optical maser 
are presented. Experimental data on the performance of pulsed ruby am- 
plifying sections and high density PbO glass Faraday rotation isolators 
are given. Feasibility tests on a two-section device have been made and are 
in agreement with predictions. Some remarks are made concerning image 
definition, channel capacity, noise and pump power requirements. 


I. INTRODUCTION 


Net gain at optical frequencies was demonstrated by the successful 
operation of maser oscillators.':? The first direct measurements of gain 
at optical frequencies were made by Javan, Bennett and Balik’ for the 
helium-neon gas maser and by Kisliuk and Boyle‘ in a ruby solid state 
maser. The optical amplifiers described by these investigators were low- 
gain devices. Since at optical frequencies the maser is the only available 
amplifier which at present preserves amplitude and phase information, 
it may well have to provide extremely high stable gains of between 30-60 
db in many applications, as for example in an optical’ communications 
system. 

It is well known that to realize a stable high-gain amplifier it is essen- 
tial to make the amplifier nonreciprocal. This has been accomplished 
in the case of the microwave traveling-wave maser® but has not pre- 
viously been reported at optical frequencies. The objective of this paper 
is to discuss the basic principles which are necessary for realizing a non- 
reciprocal optical amplifier and to report the successful operation of a 
pulsed unidirectional traveling-wave optical maser (TWOM) using 
ruby. 

Also the image amplifying ability of the TWOM is discussed and 
demonstrated. 


* This work was supported in part by the U. 8. Army Signal Corps under Con- 
tract DA-36-039-sc-87340. 
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II. BASIC PRINCIPLES 


Consider a multimode transmission line containing active maser 
material as shown in Tig. 1. Over the length shown, assume that the 
single-pass power gain is Gp, that it is reciprocal, and that. at each end 
the power reflection coefficient is r. If Gor < 1, the amplifier is stable. 
For a ruby optical maser with an air interface, r ~ 0.07; hence such a 
device will exhibit a stable gain if G < 14. If an r } 0.07 is accepted as 
typical (it may be made much smaller by the use of antireflection coat- 
ings), then it is evident that a simple optical maser with a gain of ~6 
db is a very stable amplifier with very little regeneration. On the other 
hand if a gain of 30 db is required, then r must be <0.001. This is ex- 
tremely difficult to achieve, and it is evident that a stable gain of 60 
db is essentially impossible from such a device. 

The optical traveling-wave maser (TWOM) shown in Fig. 2 is a de- 
vice capable of achieving extremely high stable gains. The device con- 
sists of a succession of amplifying sections, each of which has a gain of 
6 db for the typical figures given above. These sections are separated 
by nonreciprocal elements or isolators so that power is easily transmitted 
in the direction of the arrows but strongly attenuated in the reverse 
direction. In fact the reverse loss in db of the isolator is chosen to exceed 
twice the single-pass gain in db of an amplifying section. | 

A simple way to realize isolation is to use the optical equivalent of a. 
microwave Iaraday rotation isolator. The main differences are the fre- 
quency, the material used, and the fact that our transmission line can 
support many modes. One may use numerous materials for the non- 
reciprocal rotator. Although transparent ferromagnetics, ferrimagnetics 
or anti-ferromagnetics may be considered, many other classes of mate- 
rials could also be used; for instance it is not even necessary to use a 
material with a permanent magnetic dipole moment. Diamagnetics may 
be employed since they also exhibit I'araday rotation. In fact, non- 
reciprocal rotation was first observed by Faraday in a diamagnetic glass. 

In the microwave Faraday rotation isolator the plane of polarization 
is determined by the rectangular waveguide. In the optical counterpart 
a polarizing medium can be used to fulfill the same function. In addition 
to having nonreciprocal rotation and defining the plane of polarization, 


Fig. 1 — Transmission line containing an active maser material. 
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Fig. 2 — Schematic diagram of a unidirectional traveling-wave optical maser. 


we need to be able to absorb waves with the unwanted polarization. 
In the microwave equivalent this is done by using a sheet of loss film 
placed so that its plane is parallel to the electric field of the wave which 
we wish to absorb. In the optical version this absorption may be com- 
bined in the polarizing medium itself if we use a material which has 
dichroic characteristics. The construction of an isolator is now straight- 
forward and is shown in I’ig. 3. In Fig. 3 a wave enters from the left-hand 
side with its plane of polarization defined by the dichroic polarizer as 
shown in the end view. The plane of polarization is rotated by 45° in the 
clockwise direction by the Faraday medium and passes through the 
right-hand polarizer. However, a wave entering the right-hand polarizer 
has its plane rotated 45° in the clockwise direction and hits the left-hand 
polarizer with its plane of polarization in the low transmittance or ab- 
sorbing direction of the polarizer. The direction of the axially applied 
magnetic field with respect to the forward direction must be chosen in 
accordance with the sign of the Verdet constant of the Faraday medium 
used. 

A TWOM may be built as shown in Fig. 4. In the figure the active 
medium is illuminated with pump power in one of several known ways. 
A two-section maser is shown; however, additional sections can be added. 


FORWARD DIRECTION 





Ke, 20 
A 
7 
7 
| 
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Fig. 3 — An optical Faraday rotation isolator. 
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Fig. 4 — Elements in a two-stage TWOM. 


For best operation, of course, all interfaces would have antireflection 
coatings. In the TWOM, alternate polarizers can be eliminated for 
simplicity. 

An obvious application of the TWOM is as a power amplifier for an 
optical maser oscillator. Another application may be to obtain very 
high peak optical powers. It is well known that because of its energy 
storage the peak power of a maser may greatly exceed its average power. 
In fact some of the pulsed characteristics of a TWM have been ana- 
-lyzed.® The interaction of high optical power levels with matter will 
certainly allow many new and interesting phenomena to be studied. A 
focused beam with these peak intensities should have application to 
microwelding or micro-cutting of materials. Finally the TWOM may 
find application.to amplify received signals in an optical relay system. 

This concludes the discussion of the general ideas relating to a uni- 
directional traveling-wave optical maser. The concepts apply to either 
a CW amplifier or a pulsed amplifier. It now remains to discuss in more 
detail the theory and design of the individual amplifying sections and 
the optical isolators which have been used and tested in a pulsed ruby 
TWOM. 


III. THE AMPLIFYING SECTION 


Consider two levels 1 and 2 as shown in Fig. 5 with populations N,; 
and Ne and degeneracies g; and g2 respectively. Also assume that the 
sample interacts at the transition 1 <2 2 with a beam incident in a di- 
rection 9 and ¢ having an energy density per unit frequency range per 
unit solid angle given by p(v,6,0,P). Here v is the frequency of the 
radiation, P is a vector which defines an independent state of polarization 
of the radiation and 6 and ¢ are the polar and azimuthal angles of a 
spherical coordinate system fixed in our sample. For this situation 
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Condon and Shortly’ give the net rate of emission for transitions 1 = 2 as 
d 0,P)v" \ 
7 (v,9,¢,P) = W(69,P) {eae (g.N2 — goNi) + logis} giv) (1) 


where W(6,¢,P) is the probability per unit time per unit solid angle of a 
spontaneous transition from 2 — 1 with emission in a direction 
6,e and polarization P. 
g(v) is the normalized line shape for the transition. 
N, and N»2 are the number of atoms per unit volume in levels 1 
and 2, and 
vy is the velocity of wave propagation in the medium. 
Equation (1) gives both the stimulated emission and the spontaneous 
emission per unit solid angle in a direction @ and gy. Equation (1) takes 
into account the anisotropy of the transition probability. 
In order to find the gain or loss through the medium in a direction 
6,¢ at the 1 @ 2 transition, the portion of (1) due to stimulated emission 
is integrated in the following manner, 


f eeuaee) 3 Pee 
po P(v,9,9,P) 0 y 


where po(v,6,y,P) is the incident energy density and 1(6,¢) is the length 
of the medium in the direction considered. Hence the gain in db is given 


by 


(giN2 — goNi)g(v) dt 


Gav (v,9,9,P) == (10 log eé) | WCayep) ott do (0 


rob) (any — saNs)o(0) 
where do is the free-space wavelength and e the dielectric constant at 


SS 


Z 
- 
= 


1 


Fig. 5 — Two typical energy levels. 
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Xo. For a Gaussian line shape, (approximately true for the ruby FR, line 
at room temperature), the gain is given by 


Gav(v,9,¢,P) = (10 log e) | W(ae,p) Me) rol(8,¢) (gN2 — ‘, = 
= (y ay i 
n — Vo 


where Ay is the linewidth and »% is the frequency at the center of the line. 
Using (2), the bandwidth B over which the gain is within 3 db of the 
peak gain is given by 


lo Gav max 
B == Bu 8 Gab max ~~ 3 (3) 
log 2 


where By, = Av (cf. equation (25) of Ref. 5). If the transition considered 
is isotropic (i.e. W(6,9,P) = W = constant) then the gain given by (2) 
becomes 


2 
Gan(v,0,¢,P) = (10 log e) [Maat | (w. — a y,) 2 In 2 2 
Saer g Ay vg 


(» — »)° 
‘exp — (at 41n2 
where 7 is the spontaneous emission lifetime for the transition and is 
equal to (8gi,WV). From (2) or this last expression it is observed that 
the medium exhibits gain if (g:N2 — g2Ni) > 0 and loss if (giN2 — g2N1) 
< 0. 

In the case of ruby at room temperature, inversion at the R, line 
(14,400 cm) is obtained by maser operation as shown in Fig. 6. Here 
we have lumped the blue and green bands as one level; this is of course 
an obvious oversimplification. Also the ground state is treated as a 
single level with degeneracy g, = 4. At room temperature this is a valid 
assumption since the linewidth of the R, and Rz transition is large com- 
pared to the separation of the two zero field levels in the ground state. 
At lower temperatures the linewidths of the R, and R, transitions are 
small, so that the two zero-field levels of the ground state are resolved; 
then it is necessary to consider the ground state as two levels each with 
degeneracy 2. For ruby the time spent in the pumping states is negligible 
compared with the normal lifetime of the two metastable states shown, 
and the efficiency for atoms getting to the metastable states upon 
pumping into the green or blue bands is near 100 per cent. Based on this, 
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and assuming that pumping power is supplied for times short compared 
to the normal or stimulated emission lifetimes of the two metastable 
states, we can write 

dN, 

—_ & —fPN 4 

aL FPN (4) 
where P is the incident pumping power and f is a pumping efficiency 
factor, which depends on the pump transition probability and the 
properties of the electromagnetic structure used in the pumping process. 
Since we assume that the pump power P is supplied for a short time, (4) 
can be integrated to obtain 


NM 


t 
N exp a Pdt 
0 


= Ne’ 


where N is the total number of Cr’* ions/cc in the ruby sample and F is 
the total energy absorbed by the material at the pumping transition. 
Now since it was assumed that the time spent in the pumping levels is 
negligible compared to the spontaneous lifetimes of the metastable 


GEE: BAND 


(5) 
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Fig. 6 — Optical energy levels of Cr** in Al,0; . 
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states, the following relation holds, 
NitN,+N3;2N (6) 


and NV; = yN2 where y is determined by rz , 743 and 723. Using (2), the 
power gain G of the ruby material at the center of the ruby R, line when 
inverted is ‘ 


Ga» = A LW -£ | (7) 
91 
where 
‘ a 
A = (10 log e) | ot oe,P) ret) 2 In 2 
€ Ap T 
Upon substitution of (5) and (6) into (7) we obtain 
Gav 1 | go ca 
———— = — — |1- <1 ve CI 
(eae eg ee (8) 


ip 
paar 


where (Lab) max is equal numerically to the loss in db at the center of the 
line if all ions are in the ground state. To a good approximation this is 
equal to the loss through the unpumped material for a temperature T 
such that kT’ < hyp, , which is the case for the ruby R, line for tempera- 
tures 300°K and lower. Since (8) is derived for the case where the gain 
is measured in a time 7T'meas after the pump power has been supplied, 
which is short compared to 72: and 73; but long compared to 743 and 742 , 
the value of y can be simply given for two limiting cases: 


(a) 723 < Ta and ta < T'meas 
y = exp A(ve, — ve,)/KT. 
At room temperature for ruby y = 0.865 
(b) ra and r43 < Teas < T23 


_ 742 


TA3 
At present no accurate data are available for ruby on the times 723 , 
t4. and 743 ; however, the measurements of Wieder’ give upper limits 
for these times which indicate that case (a) is satisfied at room tem- 
perature. Also the gain measurements to be presented are consistent 
with the assumption y = 0.865. 
To determine what values of fH could be achieved and what gains 
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could be obtained for a pulsed ruby amplifying stage, gain measure- 
ments were carried out on an amplifying test section at room tempera- 
ture. This test section consisted of a 3-inch long, 0.250-inch diameter, 
c-axis oriented (0.065 per cent by weight of Cr.03 in Al,O3) ruby rod 
and a G.E. FT-91 Xenon flash tube located at the foci of a 3-inch long 
elliptical cylinder, as shown in Tig. 7. The gains were measured as is 
shown schematically in Fig. 8. The beam from a ruby oscillator was 
used as the signal source, and the gain or loss through the amplifying 
section was determined by measuring the ratio of the signal at the photo- 
multiplier with and without the amplifier in the oscillator path. The 
measured loss through the unpumped amplifier at the peak of the R, 
line was 12 db. Because the ends of the ruby were not coated with anti- 
reflection layers, the measured gains had to be corrected to obtain the 
single-pass gain. 

The expression for the numerical gain G of a reciprocal amplifier with 
feedback is given by 


G (1 =< r)'Go (9) 


a) + Ge — 2rGo cos 9 
where r is the power reflection coefficient at each end and ¢ is the rela- 


tive phase shift of a wave making one complete, round-trip traversal of 
the amplifier. Since the length of the ruby amplifier is not known to an 
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Fig. 7 — Ruby optical amplifier section. 
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Fig. 8 — Schematic description of optical gain measurements. 





accuracy better than a wavelength, the value of ¢ to be used in (9) to 
obtain the single-pass gain Gp is not known. In the gain measurements an 
interference filter was used which had a passband larger than the ruby 
amplifying linewidth. Further, the measured bandwidth of a ruby 
oscillator is 1-2 kmc, whereas the separation of I'abry-Perot modes in 
our amplifying rod is 


Cc 
Av & o/Jal ~ 0.83 kme. 


Hence it is reasonable to correct the data by assuming the proper gain 
formula to use in determining G is one where G has been averaged over 
all possible values of v. For the case where G is averaged we obtain 


Ay Clee )*@o 

The experimental data corrected in this manner for the 3-inch long 
amplifying section are plotted in Fig. 9 along with the theoretical curve 
given by (8) for a = 0.865, g. = 4 and g, = 2. Also plotted are data 
obtained on a 1-inch long ruby where the measured gains were smaller 
and the regeneration corrections needed to obtain the single-pass gain 
were less important. It is seen from Tig. 9 that our measured gain varia- 
tion versus the input pumping parameter f/ is in good agreement with 
the theory developed. 

It remains to explain how the correlation between the values of fH 


on the abscissa of Tig. 9 and the measured light intensity at the green 
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and blue bands was obtained. For this purpose, the relative intensity at 
the green and blue bands versus electrical energy input to the flash lamp 
was measured photoelectrically. The absolute relation of this relative 
intensity to fH was then determined by observing at what value of light 
intensity unity gain (0 db) was achieved with the amplifier in place. 
The maximum measured net gains for the amplifiers tested depended 
rather critically on the 'T-91 tubes used; however, by selecting I'T-91 
flash tubes and operating them at input energies of 250 joules (which is 
approximately twice their rating) net gains of 10 db and 6 db were ob- 
tained respectively for two different amplifiers. All the measurements 
were made at room temperature. Higher gains per unit length could 
have easily been achieved by cooling, for it is known from the measure- 
ments of Schawlow and Devlin’ that the linewidth of the ruby R; line 
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Fig. 9 — Gain versus input pump energy curve for a pulsed ruby maser at 
room eeriperatine’ (Lap) max is 12 db and 4 db for the 3-inch and 1-inch ruby, re- 
spectively. 
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decreases quite rapidly with decreasing temperature between 300 and 
80°K. 


IV. THE OPTICAL ISOLATOR 


To complete the discussion of the individual elements in the TWOM, 
it remains to discuss the basic principles involved in the design of an 
optical isolator and to report some performance data. 

It is well known that the plane of polarization of a light beam, when 
passing through a I'araday medium, will be rotated by a magnetic field 
H applied to the material in a direction parallel to the direction of light 
propagation. The angular rotation @ of the plane of polarization is re- 
lated to the strength of the magnetic field and the path length J in the 
medium by the expression 


6=V-H-l (11) 


where the angle @ is chosen in the positive screw sense along the applied 
magnetic field. The constant of proportionality V is known as the Verdet 
constant and is a function of the material and the wavelength of light 
used. The Verdet constant is related to the material properties in the 
following manner 


V = ru (Ny — N_)/He 


where N and N_ are the refractive indices for right- and left-handed 
circularly polarized light of frequency »). In the case of ferromagnetics 
and antiferromagnetics, large Verdet constants have been measured. 
The large Verdet constant measured in each of these materials is due to 
the fact that the ions which produce the rotation see, in addition to the 
applied field, an internal field which is of the order of magnitude of the 
exchange field. Ferromagnetics would then be especially attractive. For 
the visible region of the optical spectrum, however, for ferromagnetics 
known to the authors, this large rotation is accompanied by large at- 
tenuation per unit length of material. This is because the absorption 
bands which give rise to the rotation in the ferromagnetic material usu- 
ally extend from the ultraviolet well into the visible. However, ferro- 
magnetic materials should be useful rotators in the infrared —i.e., be- 
yond about lu. 

The problem of building a good isolator in the visible (particularly at 
the R,; line) depends upon finding a material which has a Verdet constant 
large enough to produce 45° rotation with reasonable fields and lengths, 
and which has low attenuation in the length of material which must 
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be used. For the ruby R; line at 14,400 em™, known ferromagnetics are 
at present unsatisfactory. However, two diamagnetic materials appear 
to be good choices. One of these, ZnS(8), has a Verdet constant of 0.22 
minute/em/gauss at the R, line in ruby and is essentially transparent at 
this wavelength. or ZnS(8) a rotation of 45° requires a magnetic field 
of 3100 gauss for a 4-cm length. The second material, high-density PbO 
glass, also is attractive since it has a Verdet constant of 0.09 minute/ 
cm/gauss at the FR, line and an attenuation of 0.08 db/cm. Because of 
the commercial availability of the PbO glass from Corning Glass Works, 
this material was chosen for use in the optical isolator. In both the ZnS (8) 
and the PbO glass it is the Zn** and Pb*~ ions which are probably re- 
sponsible for the rotation. In the PbO glass the absorption band which 
produces the rotation rises sharply at 4000 A, and extends to shorter 
wavelengths; this band is probably composed of the So to °Po , °P1 , *P2 
transitions in the free-ion notation. Since the configuration which gives 
rise to *Po , *P: and *P, states in the free ion contains bonding electrons 
in the solid state, the transition observed beyond 4000 A should more 
properly be referred to as a charge transfer band. This tentative assign- 
ment seems supported by measurements made on Pb*~* in CaO by 
Ewles as discussed by McClure.” The Verdet constant and the at- 
tenuation per unit length have been measured for Corning #8363 high 
PbO content glass as a function of wavelength and are shown in Fig. 10. 
The data in Fig. 10 indicate that PbO glass is a useful Faraday rotator 
for optical isolators in the wavelength range 5000-7000 A. 

To define the plane of polarization in the isolator and to provide 
reverse loss, as was discussed earlier, dichroic polarizers such as Tour- 
maline and Polaroid sheet or crystal polarizers such as Nicol prisms and 
Glan-Thomson prisms can be used. At the R, line, type HN-38 Polaroid 
sheet is a good choice, having a transmittance of 0.86 for the parallel 
orientation and a transmittance of approximately 0.01 for the perpen- 
dicular orientation. 

An optical isolator was constructed in the manner as depicted in Fig. 
3. A 4-inch long, 34-inch diameter PbO glass rod was used as the rotator, 
and Type HN-38 Polaroid polarizers were used to define the plane of 
polarization and to provide reverse loss. A water-cooled solenoid provided 
the necessary field to produce 45° rotation. The reverse loss of the iso- 
lator was measured at the R, (14,400 cm™) line and was found to be 
approximately 16 db. The measured insertion loss of the isolator was 3 
db. In this isolator, optimized operation has not been realized; however, 
measurements on the individual components were made to determine 
the best performance that can be achieved with a more carefully con- 
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Fig. 10 — Variation of the Verdet constant and the internal loss of PbO glass 
versus wavelength. 


structed isolator. These measurements on the individual components 
indicate that an isolator with an insertion loss of 1 db and a reverse loss 
of 15 to 17 db could be realized if antireflection coatings are used on all 
isolator surfaces. 


V. OPTICAL TRAVELING-WAVE MASER PERFORMANCE 


To test the feasibility of a high-gain (80-60 db) optical amplifier a 
test TWOM was assembled, consisting of two amplifiers separated by an 
isolator section. The two individual amplifiers had net gains of 10 db 
and 6 db respectively, and the isolator had an insertion loss of 3 db. 
A net gain of 13 db was expected. The measured net gain was 12.2 db; 
the discrepancy of 0.8 was probably due to the lack of perfect align- 
ment. The oscillogram presentation of the gain of the amplifier is shown 
in Fig. 11. The top trace is the signal due to the oscillator alone as ob- 
served with the photomultiplier, with the TWOM out of the beam path. 
The middle trace shows the signal after amplification by the first am- 
plifying stage with the isolator and second amplifier out of the beam 
path. Taking into account the changes made in the gain of the oscillo- 
scope, the second trace indicates a net gain of 10 db through the first 
amplifier. The bottom trace shows the signal after passing through the 
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Fig. 11 — Oscilloscope photographs showing gain in the TWOM: (a) upper 
trace shows the signal at the photomultiplier due to the oscillator alone with the 
TWOM removed from the beam. Scope gain is 2 volts/em. (b) Middle trace is the 
received photomultiplier signal with the first amplifier in the beam. Scope gain is 
10 volts/em. (c) The lower trace shows the oscillator signal as amplified by the 
entire TWOM. Scope gain is 10 volts/em with 3 db of optical attenuation added. 
The sweep time is 100 microseconds/cm in all three traces. 
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entire TWOM. For this measurement 3 db of optical attenuation was 
introduced, and if this is taken into account the net Vea of the TWOM 
was found to be 12.2 db. 

In making all the gain measurements sonora. a 1 time constant was 
used so that individual oscillator spikes were not observed. A measure- 
ment of the gain with a shorter time constant did not contradict the 
measured peak gains observed (over the oscillator duration) with the 
longer time constant and also did not reveal any new spikes that were 
not present in the oscillator. In all the measurements the firing of the 
oscillator flash tube was delayed with respect to the firing of the amplifier 
pumping tubes; this was done so that the amplifier could build up to 
full gain before the probing signal was sent through the amplifier. In 
Fig. 11 this delay was approximately 400 microseconds. There was about 
a +50-microsecond jitter in firing the oscillator, and this accounts for 
the difference in position of the signal in the three traces shown in lig. 
11. The total physical length of the tess TWOM was approximately 20 
inches, with the isolator solenoid presently being the most space-con- 
suming element. A shorter physical length is possible with a more careful 
and sophisticated design. 

Inasmuch as the diameter of the ruby rods used in the amplifiers is 
0.250 inch, the TWOM that has been described is a multimode amplifier 
capable of handling approximately 10° spatial modes. The fact that it 
can support 108 spatial modes suggests that image information can be 
sent through the amplifier. That is to say, the TWOM can be considered 
to be a limited aperture, infinite focal length lens with gain. To show 
that an image could be sent through a TWOM and amplified, a projec- 
tion slide having on it a number of dark lines was placed before the input 
of the TWOM (in this case the second amplifier was removed for sim- 
plicity) and illuminated with the oscillator beam. The slide was then 
viewed with a lens and a camera as shown in I'ig. 12. In Fig. 13 are shown 
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Fig. 12 — Schematic description of the image amplification experiment. 
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Fig. 13 — Photographs showing image amplification in the TWOM: (a) This 
photograph shows the observed image of two lines on the object slide as observed 
with unity gain. (b) This photograph shows the image when observed with the 
TWOM at a gain of approximately 5 db. 


two pictures of this slide as taken through the TWOM. Only two bars 
on the slide were illuminated by the oscillator. The left photograph was 
made with the amplifier operated at unity gain, and the right photo- 
graph was made with an amplifier gain of approximately 5 db. It is seen 
that there is greater contrast in the amplified image than in the unam- 
plified image. A quantitative measurement of the photographic negative 
with a densitometer showed a gain of approximately 4 db in the amplified 
image over the unamplified image. Although admittedly the observed 
image amplification is quite crude, it is nevertheless felt that the image 
amplification properties of a TWOM have been shown and that this 
property of the TWOM may be of importance in a communication sys- 
tem. 


VI. IMAGE DEFINITION, CHANNEL CAPACITY AND NOISE 


The capacity of a communication system depends upon the total 
number of unit phase-space cells which the system can handle. The usual 
communication system has a spatial width of only one cell (it is single 
mode), and information is conveyed only in the longitudinal direction — 
i.e., the frequency and time domain. At optical frequencies however, 
it may be convenient to use many cells in transverse space. A discussion 
is therefore given of the channel capacity of a multimode image-am- 
plifying maser. 

We define channel capacity as the product of the number of cells in 
transverse space and the bandwidth. The latter has been discussed 
earlier; it remains to consider the former. 

Evidently if we are dealing with an infinitely long cylindrical amplifier 
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with perfectly reflecting walls —i.e., a multimode waveguide with gain 
— then the number of modes for a unidirectional, single-polarization 
amplifier is tA /\’, where A is the cross-sectional area and the channel 
capacity Cis | 


where the bandwidth B is given by (3) for a Gaussian line shape. 

In practice, the amplifier is not infinitely long, and in fact the device 
described here has a length < (A/d). Under these conditions one is 
concerned with the near field region of the amplifier aperture, and image 
transmission is possible (i.e., some modes can be transmitted from input 
to output independently of the boundary conditions). 

Consider an idealized, unidirectional, single-polarization, cylindrical 
amplifier as depicted in Fig. 14 of length / and cross sectional area A. 

We consider first the image space. A transmitted image can occupy in 
transverse (i.e., v,y,P;, Py) space an area equal to AQ where Q is the 
angular aperture A/l’ and P = (hAv/c) is the photon momentum. The 
aperture A and its associated diffraction solid angle w = \°/A define 
the transverse dimensions of a unit cell.” Consequently the total num- 
ber of image cells available is 

Q A® 

, Bat = mi Bat (12) 
where At is the time over which the observation is made. Equation (12) 
may be interpreted to mean: (7) that Q/w identifiable image points can 
be amplified (this also follows from classical diffraction considerations ) 
and (iz) that the image channel capacity AC = (Q/w)B. The device 
whose operation has been described here has a AC % 3 X 10" cps. 

One can classify cells into groups according to the behavior of their 
corresponding rays of geometric optics. Those used for transmitting an 
image consist of A’/l’d" rays which pass directly from the input to the 
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Fig. 14 — An idealized image amplifier. 
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output. The next group consists of 3(4/IX”) rays which undergo a single 
internal reflection at the wall before reaching the output and fall within 
the solid angle 42 minus the image core 2. Successive groups undergo an 
increasing number of internal reflections. The reason for the A* depend- 
ence in (12) is now clear — the area and consequently the total number 
of cells which the transmission line can support is proportional to A, 
and out of this total number, that fraction which can pass directly from 
input to output is also proportional to A. If the length of the amplifier 
is greater than A/), then all rays are reflected from the walls and, in 
this limit, one is concerned with waveguide propagation. 

In principle, and to a certain extent in practice, the “off axis’ rays are 
available for transmitting information. The internal reflections will 
result in image distortion, but cylindrical symmetry is preserved and 
spatial coding can be used. At the present time, however, the channel 
capacity using only the image core is embarrassingly large and there 
seems little advantage in using the “off axis” rays. 

The limiting noise of a maser (or for that matter any coherent ampli- 
fier ) corresponds to one noise photon per unit cell of phase space, (noise 
number” N = 1). Optical masers can come very close to this limit and 
in fact the ruby amplifier described in this paper has a calculated N ~ 2. 
The number of noise photons associated with the signal is consequently 
N AC. 

In the frequency domain it is well known that there is an optimum 
bandwidth for a given signal and that excess bandwidth gives excess 
noise. Similarly in the transverse domain there is an optimum spatial 
bandwidth; in particular the angular aperture 2 used should not exceed 
that required for the necessary definition, as this would add excess noise. 
If the amplifier has excess angular aperture, the optics should be ar- 
ranged so that the transmitted image and the following receiver occupy 
only the necessary solid angle. 

In practice, additional noise is produced at the output, and proper pre- 
cautions should be taken to eliminate this excess noise. Consider first 
only the noise associated with image rays. A cross section along the 
amplifier, shown dotted in Fig. 14, may be considered as the input of the 
remaining part of the amplifier with a new angular aperture Q’ = A’*/I/ 
which is larger than Q. We have therefore an amplifier whose bandwidth 
(spatial) increases (to a limit Q’ = 27) as we travel along the device. 
Iixcess noise will appear at the output in this larger solid angle but will 
of course see less gain than that in the useful angle Q. A detector, insensi- 
tive to angle, placed at the output will see this noise, which can be se- 
vere in some designs. This excess noise, which is not intrinsic to the 
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signal band, can perhaps most easily be eliminated by focussing the 
input plane on the detector and using an aperture stop which passes only 
the focused image. 

Iiven though one may not intend to use the “off axis” rays they may, 
unless special precautions are taken, be available at the output. Com- 
monly used solid-state maser materials have large indices of refraction 
and, consequently, internal reflections may occur for large angles of in- 
cidence (for ruby the critical angle is 60°). It is evident, therefore, 
that an appreciable fraction of the spontaneously emitted photons can 
be amplified and reach the output. These can be separated from those 
in the signal image core by the method mentioned above. 

One should differentiate between presently available image intensifiers 
and maser image amplifiers. Unlike the maser, the former are not co- 
herent amplifiers and are perhaps better described as image quantum 
detectors. The quantum detector preserves the intensity of the signal 
but not its phase (and in practice it is difficult to preserve the momen- 
tum). The maser preserves both amplitude and phase and easily pre- 
serves the frequency and direction of the signal. It is a direct consequence 
of uncertainty that a coherent amplifier has a noise number N 2 1. 
The detector which preserves only one variable can in principle be noise 
free, although this does not mean that it is more ‘sensitive’ than a 
maser in some systems applications.” 

At wavelengths of < ly the black body temperature of an object 
must be >10*°K before the phase-space density can exceed unity. As a 
consequence, the phase of the radiation is indeterminate for objects 
illuminated by usual incoherent light sources. There appears little object 
in using a coherent amplifier with such an image, and in fact its noise 
can be a serious disadvantage. 

If an object isilluminated by a maser oscillator, however, then densities 
>1 can be obtained, and in fact densities >1 are necessary in a com- 
munication system in order to make full use of channel capacity. In such 
systems a coherent image amplifier which can also easily preserve the 
frequency, phase and direction of a signal may be of value. 


VII. PUMP POWER REQUIREMENTS 


One of the principal practical difficulties associated with optical 
masers is that of providing the pump. Minimum pump power and mini- 
mum pump color temperature are mutually exclusive, and in practice 
a compromise must be reached. In this section we are concerned with 
the effects of signal circuit design on the pump requirement of a CW 
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amplifier. It is clear that with the use of materials whose quantum effi- 
ciency is near unity (ruby has a Q.E. 0.9) pump photons are required 
only to replace those photons dissipated at the signal transition. In 
practice, signal circuit design can affect the pump requirements by sev- 
eral orders of magnitude. 

The total number of photons emitted at the signal transition from an 
uncertainty-limited (V = 1) maser amplifier is 


é Go —I1n é 
Ne = (RT + wD), (13) 
Here Go is the numerical gain for a given cell and D the dynamic range 
(equal to the maximum number of photons per cell in the input signal). 
n is the density of particles per unit volume per spectral interval, and 
from the previous equations n/r is related to Go. The first term is due to 
spontaneous emission and amplified spontaneous emission, and the 
second is due to the amplified signal. Equation (13) may conveniently 
be regrouped as 


3 3 
c G—l1n c G—l1n 
= > ——— ——____ — D > =. ola 
Na AN Signal (2 In G or a ) + The | Srv? In Gy + (14) 
ells AC Remainder 








The first term is determined entirely by the specifications placed on the 
amplifier; i.c., it has a channel capacity AC, a gain Gy , and a dynamic 
range D over this band. The first part accounts for the noise from an 
uncertainty-limited amplifier. The second part is due to the signal. When 
Go&D > 1, the first term becomes GoACD. The second term is the sum, 
over all other cells into which the particles can radiate, of the spon- 
taneously emitted and amplified spontaneously emitted photons. This 
term, which is concerned with cells outside the signal band, represents 
wasted energy. If AC contains only a small fraction of the total number 
of cells, this term may be approximated as K(N/r) where N is the total 
number of particles at the signal transition and K is the average of 
(Go — 1)/(in Go). Evidently if the gain for these excess cells is unity, 
then the second term is just N/7, the total spontaneous emission. 

In a practical amplifier it seems likely that the specifications and the 
design will be such that (14) will be represented to a good approximation 


by 


Certainly it is likely that a Go&D > 1 will be required and hence the 
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first part is a good approximation. If Ga&D > 1, then an insignificant 
amount of pump power will be saved by designing so that AC occupics 
an appreciable fraction of the total phase space and further, since such 
a design would appreciably increase the pump color temperature 
requirements, it seems likely that the fraction will be kept small. Under 
these circumstances, the second part is also a good approximation. A 
signal circuit efficiency 7, may then be defined as 


n> = GACD/(GACD + K N/r). (16) 


The preceding discussion is now applied to a single-mode transmission 
system —i.e., one where the information is to be sent only in the 
longitudinal domain. An obvious design for anamplifier in this application 
might be a single-mode, single-polarization waveguide TWM using fibre 
optics. A material with a quantum efficiency near unity, an inversion 
near 100 per cent, and a signal linewidth just sufficient to give the re- 
quired bandwidth might be used. As far as the material and signal struc- 
ture is concerned, such a design has 100 per cent photon efficiency; i.e., 
every photon absorbed from the pump is available for amplification at 
the signal frequency and in the signal mode. (To be strictly correct, the 
efficiency is 25 per cent if it isa single-polarization, unidirectional ampli- 
fier.) The ratio of stimulated to spontaneous emission is & [(Go — 1)/ 
In Go| for high gain in the absence of a signal. If the requirement was for 
a gain of 30 db then this ratio is +20 db. Further since one-half of the 
stimulated emission photons will be produced in the last 3 db (10 per 
cent of the length) of the amplifier, the rate at which photons are pro- 
duced at the output section is 10° times that from spontaneous emis- 
sion alone. This means that the idler relaxation time must be <10~ of 
the spontaneous emission lifetime and that the pump density must be 
10° greater than that required to produce the same inversion in the 
absence of gain. So far we have considered only the contribution from 
amplified noise. If in addition we require that the device be able to 
amplify a signal D X noise, the above figures of 10-° and 10° become 
10° and 10°, respectively, for D = 20 db. These perhaps impossible 
requirements on the idler relaxation time and pump color temperature 
have arisen because the device is too efficient. 

A less efficient alternative design is now considered for the above ap- 
plication. Consider an image amplifier of the type depicted in Tig. 14. 
Since we are concerned with only one transverse mode, we need the 
definition for only one image point —i.e., 1 2 A/). 

Equation (15) is a good approximation in this case. The first term, 
which represents essential energy, is 
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Gap max 


GoACD —= GoDAv log Gay max 3 . 
log 2 


The second term, which represents wasted energy, is given by 


N 


Ko Sk Ss wohime = KEN 
T T T 


Also, if JV is isotropic and we have complete inversion, (2) gives 


= GaveAv 927 
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and hence 


N _ Gave Av” (g2\" 7 
KX = x x fares (@) 5 
As in the last example we assume the amplifier is required to have a 
gain of 30 db and a dynamic range of 20 db. If 0.05 per cent ruby is used 
at room temperature, if complete inversion is obtained and if further we 
assume that, contrary to the device described in this paper, the isolators 
use a negligible fraction of the total length, then 


AC = 10” eps 


giving 
G ACD = 10°° photons per sec 
also 
LS 16cm 
giving 


K = ~~ 0.85 X 10” K photons per sec. 
T 


Irom (16) the structure efficiency is 


1 4 
se s12X10-. 
ee 
Since the signal is capable of interacting with almost all of the spins, 
the signal photon density has negligible effect upon the required pump 
color temperature with this value for 7, . It is essential to keep K small 
since it has a direct effect on the required color temperature as well as 
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on the required pump power. The index of refraction of most solid- 
state maser materials is so large that unless special precautions are 
taken to prevent internal reflections, a large fraction of the off-axis rays 
will see the full amplifier gain. This could easily lead to a value for 
K 100 for a 30-db amplifier with a corresponding two order-of-magni- 
tude increase in the required pump power and energy density. An esti- 
mate indicates that it should be possible to keep K < 2 with correspond- 
ingly little effect on the power and density of the pump. 

The two amplifiers just described represent design extremes. The 
single-mode waveguide amplifier has a minimum pump power but an 
excessively high pump density requirement (in this example 5 orders of 
magnitude above that for spontaneous emission alone). Conversely the 
single-cell image amplifier has a minimum pump density but an excessive 
pump power requirement (in this example 4 orders of magnitude above 
that intrinsic to the specifications). A reasonable compromise objective 
might be for 7, & 50 per cent, in which case both the power and density 
requirements would both be increased only by a factor of 2 from the 
minimum. 

The single-cell image amplifier just described is very inefficient because 
only a small fraction of the total phase space is available to the useful 
image core. A somewhat different geometry can improve this. As an 
example, one could break up the amplifier into say 10 sections, each with 
a gain of 3 db. Each section is now capable of amplifying one image 
point if 1/10 ~ A/d;i.e., A and hence K(N/7r) can be reduced a factor 
of 10 with a corresponding increase in y,. Of course with this decrease 
in A, the diffraction angle is larger and unity magnification lenses must 
be placed between sections to reform the image at the input plane of 
the next section. The extent to which this subdivision can be continued 
is In practice limited by the fact that electronic gain per section decreases 
with the number of sections while the circuit loss per section will be 
substantially constant, leading to an eventual decrease in the net gain 
of the entire device. 

This section has been concerned primarily with amplifiers for use in 
a single-mode transmission system, and the information has been con- 
veyed in the longitudinal domain. It has been shown that pump density 
and power can vary over wide ranges by signal circuit design. In prin- 
ciple, analogous arguments apply to an image amplifier where informa- 
tion is to be conveyed primarily in the transverse domain: in practice, 
the problem is primarily one of increasing the efficiency and would ap- 
pear to depend largely on obtaining narrow linewidth materials. Narrow- 
frequency band filters are probably also necessary to obtain high effh- 
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ciency. It is perhaps worthwhile noting that filters of the Iabry-Perot 
type are narrow in entire momentum space and as such inhibit image 
transmission. 


VIII. RECIRCULATION 


Since the amplifiers described here are traveling-wave devices, they 
may be folded to achieve a more desirable geometry. An extreme case 
of folding is to send successive signal passes through what is otherwise 
a single section; i.e., an amplifier is designed so that its transverse ca- 
pacity exceeds that required for the signal, which may then be recircu- 
lated in this otherwise excess image space. 

If, for example, one is concerned, as in the last section, with an ampli- 
fier for a single-mode transmission system, then each pass must occupy 
a region Ax Ay AP, AP, = h’. Further it is essential to make efficient use 
of this transverse space if 7, is not to deteriorate. It may be necessary to 
provide image guard bands to inhibit feedback: the resulting decrease 
in 7; May in practice be more than compensated by an increase in the 
pump circuit efficiency with this geometry. 

The recirculating optics can take many forms. Rectilinear reflections 
ean produce the simple single-cell image amplifier described in the last 
section in compact form. The addition of lenses can produce the folded 
equivalent of the third type of amplifier described in the last section. 
If a Faraday rotation isolator is used, an optically active material such 
as quartz in the recirculating path can restore the polarization. The 
number of passes is limited by the losses in the optics and by the amount 
of feedback which can be tolerated. 


IX. TRANSIENT BEHAVIOR 


The fact that a TWM can store energy and release it on demand is 
well known. This property has been of only academic interest at micro- 
wave frequencies but may be one of the most important characteristics 
of an optical device. The transient behavior of a TWM has been in- 
vestigated by Schulz-DuBois* and may be summarized as follows. If a 
strong signal in the form of a step function is applied, then the leading 
edge will see the full amplifier gain and, in the process, will drain off 
active particles so that successive portions of the signal experience less 
gain. As the signal travels along the amplifier, it assumes somewhat the 
shape of a shock wave where the sharp leading edge sees the full gain, 
and the integrated energy in the pulse is essentially equal to the original 
stored energy in the preceding portion of the amplifier. The output pulse 
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width is limited by the rise time of the amplifier. This theory is valid 
as long as the material characteristics at the signal frequency can be 
described in terms of the rate equations. 

The rise time of a ruby TWOM at room temperature is 2 X 10-” 
sec. By using the pulse-sharpening property of the device, it may be 
possible to produce light pulses shorter than any available by conven- 
tional electronics. 

It is easy to design a TWOM of cross section less than 1 cm? whose 
stored energy is greater than one joule. With such a device, rise time is 
unlikely to limit the peak power; instead, the limitation will probably 
be material breakdown. In the present design the Polaroid is believed 
to be the most susceptible component but could be replaced by, say, 
Glan-Thomson or Brewster angle polarizers. We expect that peak powers 
of 10’ watts/cm? can be achieved quite easily without focusing and that 
considerably higher power densities are realizable. 

Once the limiting power density and cross-scztional area are reached, 
the beam can be divided into parallel channels forming a phased array. 
If phase stabilities approaching that of the microwave TWM can be 
achieved, several divisions will be possible. 


xX. CONCLUSION 


The feasibility of an image-amplifying optical traveling wave maser 
has been demonstrated. Since each section is short circuit stable, they 
may be cascaded to produce any desired gain. 

The experimental data on the gain as a function of pump power are 
in good agreement with theory. 

The loss and Verdet constant of high density PbO glass has been 
measured and is such that in combination with dichroic polarizers it leads 
to a satisfactory isolator. Materials with higher Verdet constants would 
be preferable. 

The theory shows that, in order to build an amplifier which is low 
noise and which has a minimum pump requirement, one must use high 
quality optics which are diffraction limited. 

It is believed that very high peak powers and very short pulses can 
be produced with the device. 

An effective channel capacity ~10" cps has been obtained and one 
is faced with the rather unusual problem of reducing the bandwidth 
of an amplifier. 
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Further Analysis of Errors Reported in 
“Capabilities of the Telephone Network 


for Data Transmission” 
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The recorded error data from a field testing program reported on by Alex- 
ander, Gryb, and Nast have been further analyzed. New methods of analysis 
have given more information on the causes and nature of errors experienced 
by data in the switched telephone plant. The results obtained will enable 
workers in the field of error control to use the field test data more effectively. 
The ideas presented will be useful to designers of future field tests. 
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I. INTRODUCTION 


The purpose of this paper is to describe certain characteristics of 
errors affecting digital data in the switched telephone network. The 
present results are based on an analysis of error data recorded in a field 
testing program conducted by the Data Transmission valuation Task 
Torce of the Bell System. These data are summarized in a previous 
paper by Alexander, Gryb, and Nast! which sets forth numerous con- 
clusions about the telephone plant, based on the test program. The 
present paper reports the results of a sequence of statistical investiga- 
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tions which have made it possible to classify most of the errors into 
several populations. The effect of the test equipment on the time-dis- 
tribution of errors is also discussed. 


II. THE TASK FORCE TESTING PROGRAM 


This section contains a description of those aspects of the test pro- 
gram which bear on the present discussion. Test calls were made by the 
Task Force over a wide variety of circuits. Both local-and long distance 
calls were made over the switched telephone network and the perform- 
ance of the circuits was recorded. Data concerning 1010 test calls were 
gathered and used for analysis. The test calls were of three categories: 
10-minute calls at 600 bits/second, 10-minute calls at 1200 bits/second, 
and 30-minute calls at 1200 bits/second. Each of these categories was 
further subdivided into ‘Exchange Calls” which used no long distance 
switching facilities, “Short Haul Long Distance Calls” which were made 
over distances of less than 400 miles, and “Long Haul Long Distance 
Calls” which were made over distances of between 400 and 3000 miles. 

The source of the transmitted data was a word generator which pro- 
duced a sequence of marks and spaces repeating with a period of thirty 
bits. This word generator was used to drive an FM modulator. (The 
terms ‘‘mark” and ‘‘space”’ designate the two states of the FM channel. 
The convention here is that the lower of the two frequencies used is 
called mark and the higher is called space.) At the receiving end of the 
circuit, the FM signal was demodulated and compared with the output 
of another word generator identical to that at the transmitting end. 
When the output of the word generator at the receiving end agreed with 
the received signal, the bit was accepted as correct; when they differed, 
an error was noted. The two generators were kept in step by electronic 
clocks. The sequence of marks and spaces produced by the word gen- 
erators was as follows: 


SSSSMSSSSMMSMSSMMMMSMMMMSSMSSM. 


The sequence of bits received correctly and bits received in error was 
recorded serially on a magnetic tape with two channels, as follows: The 
first channel contained only clock pulses, one pulse per bit. The second 
channel contained a pulse opposite the corresponding pulse in the first 
channel every time an error was noted, and was blank whenever no error 
occurred. The relative bit phase of the word generator, 1.e., the position 
in the 30-bit word where the call began, was not recorded for any of 
the calls. The methods described below made it possible in most cases 
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to determine whether a particular error changed a mark into a space or 
a space into a mark, and to determine which, if any, of the thirty posi- 
tions in the test-generator word were most susceptible to errors. 

The demodulator contained what is known as a zero-crossing detector, 
which counted the number of times the received signal crossed a neigh- 
borhood of zero. This detector delivered an output which was propor- 
tional to the number of zero crossings per unit time. A threshold was 
established and when fewer zero crossings were detected, a mark was 
scored; when more zero crossings were detected, a space was scored. 
Thus when no signal at all was received, the absence of signal was in- 
terpreted as a mark. 

With a detector of this type, impulse noise would be more likely to 
add zero crossings to the received signal than to subtract them if the 
impulse noise were of higher frequency than the signal frequencies used 
in the tests. On the other hand, disturbances of lower frequency would 
tend to change spaces into marks by subtracting zero crossings. As it 
turned out, most errors caused by noise were mark-to-space errors, but, 
the fact most important to our methods of analysis was that a single 
type of disturbance during a call caused one of the two types of error 
predominately. By a “single type of disturbance” we mean any statis- 
tically recognizable pattern of errors. 

Much of the remainder of the discussion of this test program will de- 
pend on three of the properties of the system mentioned above, namely: 

(1) a cyclic 80-bit word generator was used, 

(2) any particular kind of disturbance on the line caused errors asym- 
metrically, i.e., either a majority of mark-to-space errors or a 
majority of space-to-mark errors, 

(3) loss of signal caused all bits to be received as marks. 

During some of the calls in the test program, the circuit was lost and 

a new attempt was made. On other occasions, the clock which kept 
the receiving word generator in phase got out of step, sometimes because 
of loss of signal. In either case, the error tape was erased and a new 
error recording was started from the beginning. Some of the calls en- 
countered one or both of the above difficulties. Only the calls which were 
successfully completed were reported. 


HI. THE EFFECT OF DROPOUTS 


Rather early in the study it became obvious that, in some of the test 
calls, errors were caused not by noise but by loss or serious attenuation 
(“‘dropout’’) of the received signal, often for very short periods of time. 
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When this occurred, all bits were received as marks and errors were re- 
corded only when the receiving word gencrator produced a space. When 
the receiving word generator produced a mark, the bit was recorded as 
being received correctly even though the mark that was received was 
completely independent of what was transmitted. The pattern of errors 
recorded during such a period was: 


111101111001011000010000110110, 


where a 0 stands for a bit accepted as correct, and a 1 stands for a re- 
corded error. Thus, when the line was open or for some other reason no 
signal was being received during the tests, 16 errors were recorded for 
every 30 bits transmitted. The error rate observed during a dropout is 
evidently dependent on the proportion of spaces transmitted, which in 
many systems of error control] differs widely from 16 out of 30; for in- 
stance, in the so-called 2/8 code, 2 marks and 6 spaces are sent in every 
block of 8 bits. 

Computer programs were written by the author which could detect 
the pattern shown above in those cases where the dropout extended 
over more than about twelve bits. A dropout occurring in the first nine 
bit-positions of the test word would have caused eight errors, and, since 
the computer program was adjusted so as to find dropouts causing more 
than five errors, this dropout would be found. A dropout which covered 
the sixteenth through the twenty-fourth bit-positions would not be 
found, since it would have caused only one error and could not be dis- 
tinguished from other error-producing effects. Because of the distinctive 
pattern of errors, the bit phase of the word generator could easily be 
determined in calls with long dropouts. 


IV. THE EFFECTS OF NOISE 


During some test calls, the nature of the disturbances on the trans- 
mission facilities was such that there were many more mark-to-space 
errors than space-to-mark errors. On other test calls, the reverse was 
true. A computer program was written by the author which made it 
possible to determine which calls were of these two types and to decide 
the bit phase of the word generator in those cases. The exact methods 
by which this was done are described in Section V. 

In some calls, there were ten to twenty times as many mark-to-space 
errors as there were space-to-mark errors. In these calls, errors did not 
fall on all of the mark positions with even approximately equal frequency. 
Some mark positions were more vulnerable to errors than others, al- 
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though the vulnerable positions were not always the same in different 
calls. In certain calls, the average number of errors on the four exposed 
mark positions (a single mark between spaces) exceeded by a factor of 
ten the average number of errors on the remaining mark positions. 


V. METHODS OF DETERMINING BIT PHASE 


Three different methods were used by the author to determine the bit 
phase of the word generator in the test calls. None of these was effective 
on calls with less than about fifty errors; in fact they occasionally failed 
on calls with as many as several hundred errors. However, the bit. phase 
was determined for enough calls to account for somewhat more than 65 
per cent of all the errors reported in the test program. 

The first method was by far the simplest. The error data were visually 
scanned for the characteristic pattern of a dropout. In only a few calls 
were there long enough dropouts for this method to produce results. 

The second method used the IBM 7090 computer to divide the se- 
quence of good bits and bits in error into consecutive blocks of thirty 
bits each. The positions in these blocks were numbered from 1 to 30 and 
the number of errors corresponding to each numbered position was 
totaled over all of the blocks in the call. This produced a sequence of 
thirty numbers such as the one in Fig. 1, which is the result actually 
obtained for one of the test calls, Call No. 2330; this was a 30-minute 
Exchange Call at 1200 bits/second. This call had a total of 678 errors, 
and if the errors fell randomly at each bit position, then more than half 
of the thirty numbers in [ig. 1 would be expected to lie in the range 
19-26 inclusive. This is far from being the case, since only three of them 
do so. About half of the numbers are surprisingly small and most of the 
rest are surprisingly large. There is a wide gap between the two collec- 
tions of numbers. When the relative bit phase of the word generator in 
this call is assumed to be that of Fig. 2, which is correct, the numbers 
of errors corresponding to the fourteen bit-positions of the test generator 
word which are marks are: 


25, 31, 20, 85, 27, 40, 28, 79, 46, 55, 43, 22, 66, 44 


and the numbers of errors corresponding to the sixteen bit-positions 
which are spaces are: 
9,1, 11, 2, 3, 8, 2, 2, 6, 4, 7,1, 1, 6, 4, 0. 


It is difficult to ignore the fact that this choice of the relative bit phase 
of the word generator divides the thirty numbers in Fig. 1 into two dis- 
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25 


25 


31 20 85 1 27 40 28 79 11 2 46 3 8 55 2 2 6 4 48 7 1°21 «6 22 


Fig. 1 — Output of computer program for call No. 2330. 
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Fig. 2 — Correct choice of word generator phase. 
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Fig. 3 — Incorrect choice of word generator phase. 
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tinct populations; the smallest number in the first list is 20 and the largest 
number in the second list is 11. Another (but erroneous) choice of the 
bit phase of the word generator is represented in Fig. 3. With this choice, 
the numbers of errors corresponding to marks are: 


9, 1, 11, 2, 3, 2, 2, 6, 4, 7, 1, 1, 6, 4 
and the numbers of errors corresponding to spaces are: 
25, 31, 20, 85, 27, 40, 28, 79, 46, 8, 55, 48, 22, 66, 44, 0. 


Two of the numbers in the second list (the 8 and the 0) seem to stand 
out as belonging more naturally to the first list. 

Because of the pattern in Tig. 1 of four large numbers, then one small, 
then four large, followed after an interval of six bits by four small, then 
one large, then four small, the only choices of word-generator phase 
which make sense are those shown in Figs. 2 and 3. The choice between 
them is not difficult in this case. This pattern is repeated in dozens of 
calls; often it shows up even more clearly than in the example given 
here. 

Calls in which there were numerous dropouts too short to be recog- 
nized by eye had an excess of space-to-mark errors. These calls were 
found and analyzed by the same method, but in this case high numbers 
corresponded to spaces. 

These two methods failed to determine the bit phase in many calls 
which had very large numbers of errors. It was suspected that some of 
these calls had both short dropouts and, in addition, enough mark-to- 
space errors so that neither of the two kinds of error was in large excess. 
To handle this situation, a computer program was written which pro- 
duced sequences of thirty numbers as above, but this time the first line 
included only the single errors, the second line included only the double 
errors, etc. Two examples of this presentation are given in ligs. 4 and 5. 
Now, errors caused by noise most commonly occurred one by one, and 
errors during a dropout occurred primarily in two’s and four’s as can be 
seen by reference to the test-generator word. Therefore the first line of 
this presentation could be scanned for any great preponderance of mark- 
to-space errors caused by noise, and the other lines could be scanned 
for evidence of short dropouts, which were also made more visible by 
this presentation. Many of the remaining calls succumbed to this double- 
barrelled attack. Fig. 4 gives an example of a call in which the errors 
were mostly due to dropouts, and Fig. 5 gives an example of a call in 
which the errors were mostly isolated mark-to-space errors. 


es Ga: a as | A ae Ee a | a 
000116002 2 1000M 10000100018 03 0 1@ 1 2 
0 210 00 00600 i100 0 2 0060 000 0 006000 10 0 0 0 0 
®oo000@00000000000000000000000i21 
100 00000000000 0000000600002 00 00 0 
000 00 10 00 060 00 0 0 0 00 000 000 00 0 0 0 0 0 
000 00 000 00 000 00600 0 00 00600 0 00 0 0 0 0 0 
00000 000 00 0 00 0 00 060 06000 000 00 0600 0 0 0 
200 00 0000 00000000 0600600006000 00 00 0 0 
Ss Ss SMS SS SMMSMSSMMMMSMMMMS SMS SM 


Fig. 4 — Output of second computer program for call No. 2249. The top line of this presentation contains the total number 
of isolated errors corresponding to each bit-position in the test word. The second line contains the total number of double errors 
corresponding to each position, with an entry made only once for each double error at the first of the two bit-positions it covers. 
That is, a double error on the first and second bit-positions would contribute 1 to the count in the first entry of the second line. 
Third and following lines are similar displays of triple errors, etc. In this, the circled entries constitute strong evidence of drop- 
outs. 
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23 2 131 0 4 1 021 6 024 0117 7 411 011 7 4 6 0 022 0 1 26 
00022000151010061 451129 12000000 21 
000100001200000110005100002120021 
0000000001001003200000000080%0 00 0 0 
S§S8SSMSSSSMMSMSSMMMMSMMMMSSMS SM 


Fig. 5 — Output of second computer program for call No. 1607. In this call there is no evidence of 
any dropout, but the first line indicates many more isolated errors on the marks than on the spaces. 
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VI. METHODS OF FINDING SHORT DROPOUTS 


Even though the last method mentioned was able to reveal short 
dropouts in a call, it gave no indication of their exact lengths or where 
they were located in the call. Another computer program was written 
which went bit-by-bit through each call of known bit phase, and counted 
up how many marks had been received out of the previous thirty bits. 
If no errors were recorded in the previous thirty bits this number was 
at all times 14. If mark-to-space errors were occurring, the number was 
less than 14. But, if a dropout occurred, even over a few bits, the number 
rose dramatically and made it clear where the dropout probably began 
and ended. By this method, combined with visual inspection of the 
original data, every dropout which caused more than five errors in any 
call of the test program was found, except for the remote possibility 
that during a dropout noise simulated the transmitted word. 

Another interesting phenomenon was discovered by means of these 
computations; namely, that often there was a dropout during which a 
few spaces would be received, but there was no correlation between 
these spaces and the spaces actually sent by the transmitter. This in- 
dicates that on occasion, even when the line was open, there was enough 
noise received to simulate a space, so that some bits which were recorded 
as correct in the field trials were actually the result of two potential 
errors in the same bit-position whose effects cancelled. Short periods of 
time during which the transmitted signal was absent, but during which 
impulse noise caused several spaces to be received, are not classified as 
dropouts, since the resulting error patterns are not distinctive. 


VII. ANALYSIS OF THE DROPOUTS 


In the test calls, a total of 58 dropouts extended over more than 12 
bits, and these were analyzed. Of these, none were in Exchange Calls, 
3 were in Short Haul Calls, contributing 110 errors, and the remaining 
55 were in Long Haul Calls, contributing 3400 errors. The longest drop- 
out was 1129 bits long (approximately 1 second). It is estimated that 
dropouts lasting 12 bits or less contributed at least 1500 additional 
errors. 

In addition to those discussed above, many dropouts occurred during 
the tests and were not reported because these dropouts caused a loss of 
synchronization between the transmitting and receiving word genera- 
tors or were long enough to cause the test personnel to terminate the 
call. The recording of errors in these cases was abandoned and a new re- 
cording started. 
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It is difficult, therefore, on the basis of the reported data, to specify 
quantitatively the contribution of dropouts to the total error rate. This 
contribution was negligible in the case of Exchange Calls and Short 
Haul Calls. In Long Haul Calls, dropouts were a major source of error. 
Even after the omission of records of many dropouts, as explained above, 
about 10 per cent of all Long Haul Calls had one or more dropouts, and 
these dropouts contributed over 20 per cent of the crrors in Long Haul 
Calls. 

Since there were no dropouts in the final data on Exchange Calls, 
and only negligible ones for Short Haul Calls, and since a very large 
number of calls were made in these categories, it seems safe to assume 
that the causes for dropouts lay in long-distance transmission or switch- 
ing facilities, or at least that these facilities formed an essential link in 
the chain of causes leading to a dropout. 


VIII. ANALYSIS OF ERRORS DUE TO NOISE 


There were 36 calls in which the bit phase of the word generator was 
established and in which there were no errors that could be attributed 
to dropouts. In these calls, the number of mark-to-space errors was 
slightly more than four times as great as the number of space-to-mark 
errors. This statement, though true, is somewhat misleading, because 
this was just the kind of call in which the bit phase was easy to deter- 
mine. It is certain that none of the remaining calls with more than about 
50 errors has any such discrepancy between the two kinds of error, or 
else its bit phase would have been determined. What can be deduced is 
that some transmission paths favored this kind of error very much. 

In some calls, an abnormally high number of errors fell on those bit 
positions which corresponded to single marks between spaces; in others, 
an abnormally high number fell on the positions corresponding to marks 
immediately preceding spaces. These effects may have been caused by 
conditions on the line, such as high delay distortion; or they may have 
been caused by characteristics of the test equipment, as illustrated in 
Section IX. 


IX. THE CASE OF CALL NO. 2390 


Call No. 2890 was a 30-minute Long Haul Call at 1200 bits/second 
with 4565 errors. This call had more errors than any other call in the 
test program. In spite of the large number of errors, the three methods 
described did not reveal the bit phase of the word generator in this call. 
Therefore, the pattern of errors in the call was subjected to close scru- 
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tiny. It was discovered that the sequence of good bits and bits in error 
at the end of the call was precisely that which would be recorded if the 
receiving word generator were one bit out of step with the transmitting 
word generator. Somewhat further back in the call the pattern corre- 
sponded to the two generators being two bits out of step. This was traced 
back through the call step by step until finally a place was reached where 
the two word generators were out of step by fifteen bits. By this time, 
several thousand errors had been accounted for. The search was not 
carried beyond this point. 

Kividently the bulk of the errors in Call No. 2390 were caused by a 
failure of the terminal equipment to keep in step. These 4565 errors make 
up 12 per cent of the errors observed during the entire testing program 
and 17 per cent of those observed in Long Haul Calls. 

It does not seem justified to let these pseudo-errors contribute to the 
reported error rate on the telephone network. And it is certain that this 
call should not be used in a study of error control or of the burst struc- 
ture of errors, since not only the pattern of errors but the errors them- 
selves were caused by the nature of the test equipment and not by the 
characteristics of the telephone plant. 


X. SOME SPECIAL RESULTS 


During Call No. 1568, a Long Haul Call at 1200 bits/second, the field 
test personnel noted that four bursts of errors which caused a total of 
about 400 errors coincided with audible multifrequency keypulses. A 
direct inspection of these errors failed to reveal much regularity in the 
pattern of errors within the bursts. After the bit phase of the word gen- 
erator in this call was determined, the sequence of received marks and 
spaces was inspected. During the four keypulses, the received signal 
conformed to the following four patterns, respectively: 


MMMS SSMMMSSSMMMSSS 
MMMMMSMMMMMSMMMMMS 
MMS MMSMMS MMSMMS MMS 
MMMMSSMMMMSSMMMMSS 


with occasional errors superimposed, presumably arising from noise. These 
patterns have periods of 6, 6, 3, and 6 bits, respectively, which corre- 
spond to frequencies of 200, 200, 400, and 200 cps. These frequencies 
may be related to the fact that the two tones transmitted simultaneously 
during an MF pulse are always separated by a multiple of 200 cps. 

In parts of some calls, remarkably many errors were separated by 
exactly 57 good bits. On investigation, these all turned out to be calls 
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at 1200 bits/second which terminated in a few particular step-by-step 
central offices. Most probably, these errors were caused by some switch- 
ing mechanism in the telephone plant which, when it operates, produces 
a 21-cps disturbance. 

In Call No. 1420, a Long Haul Call at 1200 bits/second, there were 
four long dropouts and their lengths were 301, 302, 329, and 337 bits. 
The closeness of lengths of these four dropouts suggests that all of them 
had the same cause. 

In Call No. 2429, a Long Haul Call at 1200 bits/second, there were 
eight long dropouts and their lengths were 33, 33, 67, 67, 94, 95, 97, and 
126 bits. All of these lengths are close to being multiples of 32 bits, again 
suggesting a common source of the dropouts. 


X1. THE ROLE OF THE TEST WORD 


The choice of a cyclic word generator as the source of the transmitted 
data did not affect the measurement of error rate significantly. However, 
its characteristics were strongly reflected in the final error recordings be- 
cause of the considerable contribution of dropouts to the total error 
rate. The fact that the length of the test word was 30, a number with 
many small factors, again did not affect the measurement of error rate; 
but if the data were used in simulation experiments — for example, to 
test an error-detecting code whose length was a divisor of the number 
30 — the results could be biased by this choice of length of test word. 

After the bit phase of many of the calls was revealed by analysis of 
the error recordings, it was possible to resolve the following questions: 

(1) whether any particular error changed a mark into a space or the 

reverse, 

(2) which, if any, of the 30 bit-positions were most susceptible to 

errors, 

(3) to what extent errors were caused by dropouts, and 

(4) the contribution of terminal-equipment malfunction to the ob- 

served error rate. 

Using the bit phase to study the distribution of errors among the 30 
bit-positions of the test word, it was discovered that the most vulner- 
able positions were different in different calls. It would be instructive 
to be able to compare these differences in vulnerability with the meas- 
ured characteristics of the transmission medium. There were a number of 
calls in the series for which almost all of the errors fell on a single bit- 
position in the test word. It is not known either why this occurred or 
which bit position was affected, since it is impossible to determine the 
bit phase for such calls. 
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XII. RELATION TO ERROR-CONTROL SCHEMES 


Once of the results of this study is the knowledge that with the modu- 
lation scheme used in the testing program, the mark-to-space and space- 
to-mark errors were not evenly mixed with each other. lor a long period 
of time the mark-to-space variety was more abundant by a large factor, 
owing to noise; at other times the reverse was true, because of dropouts, 
interfering tones, and the like. An error-detecting code which counts 
the number of marks per block is much more efficient in this situation 
than when mark-to-space errors and space-to-mark errors are well mixed. 

Tor example, in the 2/8 code, which has been suggested for error de- 
tection, bits are transmitted in blocks of eight, and in every block there 
are exactly two marks and six spaces. A received block with more or less 
than two marks is detected as an error. This scheme will always detect 
an error condition if a single error occurs in a block. If a block contains 
two errors (which is likely because of the burst character of the errors) 
then an error condition will be detected if both errors are in the same 
direction but not if they are in opposite directions. Assuming that the 
two kinds of error are well mixed and that sent marks are four times as 
likely to be in error as sent spaces, we find that a double error in a block 
will be detected only 39 per cent of the time. The truth is that, because 
of the lack of mixing, a double error in a block would be detected over 90 
per cent of the time by this code. During a dropout the situation is even 
more striking. A message using this code would experience a bit-error 
rate of 75 per cent during a dropout but every one of these errors would 
be detected. The relative advantage of this code and of other codes of 
this type would go unnoticed if the codes were evaluated by a direct 
simulation using the field test data. 

A useful technique in studying error-control methods is the construc- 
tion of mathematical models of the occurrence of errors. Several such 
models have been constructed using the field test data,?:? and have been 
used to study the effectiveness of error-control procedures.* To test the 
validity of such models and to evaluate the parameters used in the 
models, it is necessary to study the manner in which errors are distributed 
in time, e.g., to what extent they occur in bursts. One way of doing this 
is to determine the distribution of lengths of runs of consecutive errors 
and the distribution of lengths of runs of good bits between errors.” Be- 
cause of the structure of the test generator word, every run of consecu- 
tive errors in the field test data during a dropout had length 1, 2, or 4 
and every run of consecutive good bits during a dropout had length 1, 
2, or 4. These distributions of run lengths depended only on the choice 
of test-generator word. Also, repetitive patterns of errors in Long Haul 
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Calls were often produced by causes other than dropouts. For instance, 
the single event described in Section [X contributed 17 per cent of all 
the errors in these calls, and occasions (described in Section X) when the 
signal was overridden by tones contributed about another 10 per cent. 
Thus, including dropouts, about half the errors in Long Haul Calls oc- 
curred in repetitive patterns, and these patterns reflected the nature of 
the test equipment and the modulation scheme rather than that of the 
telephone plant. 

Also, when short spikes of impulse noise caused bits to be received as 
spaces regardless of what was transmitted, errors were recorded in the 
field test data only when the occurrence of a spike of noise coincided 
with a transmitted mark. Those which coincided with transmitted spaces 
were not recorded as errors. Therefore, the mean length of runs of good 
bits in the data is about twice the mean distance between spikes of im- 
pulse noise. If the mark-counting code described above were tested 
against these data, it would appear to experience twice as many errors 
as it would have experienced if it had been used on a real-life channel. 


XITI. CONCLUSIONS 


With no knowledge of the bit phase of the word generator, the only 
error statistics which could be reliably deduced from the field test data 
were the error rates themselves. With the recovery of the bit phase, the 
proportion of mark-to-space errors could be determined, dropouts were 
revealed, and certain malfunctions of the test equipment were discovered. 
Now a considerable amount of additional information is available about 
the causes and nature of errors experienced by data in the switched 
telephone network. This information makes the existing data more use- 
ful for simulation and analysis. 
The analysis reported in this paper confirms some results of Ref. 1 
and modifies some others. It is unrelated to most of the material dis- 
cussed in Ref. 1. Specifically: 
(1) The subject matter of this paper has no bearing on I’igures 1—25 
of Ref. 1. 

(2) The reported error rates and the distributions of error rates by 
classes of calls are essentially as indicated in Figures 26-29 of 
Ref. 1. 

(3) The time distribution of errors over large blocks (e.g., more than 
50-100 bits), as shown in the right-hand portions of Figures 30, 
31, 32, 34, 36, and 38 of Ref. 1, is not changed. 

The validity of the time distribution of errors over very short 
blocks, (e.g., 2-10 bits), as represented in [Figures 33, 35, 37, 39 


(4 


eed 
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and the left-hand portions of Figures 30, 31, 32, 34, 36, and 38 of 
Ref. 1, has not been confirmed. 
(5) The problem mentioned in (4) throws doubt on the data used 
to construct Figures 40-43 of Ref. 1. 
The results whose validity is questioned in (4) and (5) follow from ar- 
bitrary choices made in designing the field tests. The effects of such 
choices are considerable and hard to evaluate. It is therefore difficult 
to assess the nature and extent of the changes that should be made in 
the figures named in (4) and (5). 

Because line dropouts (and the loss of word-generator synchronization 
in one call) contributed significantly to the error rate, the choice of the 
test word influenced the time distribution of errors, as noted in (4) 
above. This must be considered in evaluating schemes for error correc- 
tion and detection. 

In most of the calls with high error rates not caused by dropouts, the 
errors had a distinct tendency to change marks into spaces. This also 
must be considered in evaluating error-control schemes. 

It must also be borne in mind that the property of receiving all bits 
as marks in the absence of signal and the property that impulse noise 
usually changes marks into spaces are not themselves characteristics of 
the telephone plant, but are dependent on the particular devices used 
for modulation and demodulation (Ref. 1, pp. 485-6). The important 
point is that the methods and point of view of this paper can be applied 
to data obtained with any system of modulation. One cannot hope to 
devise a good method of controlling errors without a deep knowledge 
of the nature of the errors and of how the terminal equipment is affected 
by them. 
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Grade of Service of Direct Traffic Mixed 
with Store-and-Forward Traffic* 


By JOSEPH OTTERMANT 
(Manuscript received November 28, 1961) 


The dual use of trunks for both direct and store-and-forward (S/F) traffic 
makes high trunk efficiency possible. The resultant trunk savings are im- 
portant in communication systems in which long-haul trunks contribute 
heavily to the cost of the system. This paper reports work on the computa- 
tion of trunking tables that could be used to engineer trunk requirements 
for prescribed loads and distributions of direct and S/F traffic. 

A method of computation and some specific results in terms of grade of 
service of direct traffic and traffic capacity for S/F traffic are presented. 
The numerical results are for two to forty-eight trunks. The results apply 
to the case of exponentially-distributed holding times of both the direct and 
the S/F traffic. 


I. INTRODUCTION 


Direct traffic is a user-to-user service (generally voice) requiring a 
connection to be established promptly on demand. Store-and-forward 
(S/I*) traffic, on the other hand, is stored at or near the originator’s 
location and is later sent to its destination, either directly or through 
further intermediate storage. If S/I* traffic is sent only when the direct 
traffic load is light, large amounts of S/F traffic can be accommodated 
with very slight degradation of direct traffic service. This can be es- 
pecially important in a long-haul communication system where the 
trunk group cross sections (number of trunks in a group) are small. 
Such trunk groups are notoriously inefficient if used only for direct 
traffic, but they have a substantial ability to handle additional S/F 
traffic. 


* This work has been carried out under U. 8. Army Signal Corps Contract 
DA-36-039-SC-78806. 

+ I.T.T Federal Laboratories, Nutley, N. J., (work performed for Bell Tele- 
phone Laboratories). Present affiliation, General Electric Company. 
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The basic operating method analyzed in this paper is as follows. The 
occupancy of the trunks in a trunk group is monitored, and whenever 
the occupancy drops below a certain level, the S/F traffic is allowed 
access to the idle trunks. The sending of an S/I* message is not inter- 
rupted in order to service arrivals of direct traffic, but when transmis- 
sion of an S/F message is begun, a specified number of trunks is always 
held in reserve for direct traffic. Under the foregoing rules, the present 
analysis establishes (7) the grade of service of direct traffic and (7) the 
amount of S/F traffic that can be accommodated. Direct traffic conges- 
tion discipline is assumed to be governed by the lost-calls-cleared assump- 
tion,* which means that direct calls which encounter an all-trunks busy 
condition do not reappear in the busy hour. 

The statistical nature of traffic is first summarized. The problem of 
dual use of trunks is formulated as a probabilistic net representing a 
Markov chain. It is assumed that a queue of S/F traffic exists at all 
times. Using cut-sets in the state graph, this probabilistic net is ana- 
lyzed to derive expressions for the steady-state probabilities of trunk 
occupancy. I‘rom the expressions of trunk occupancy for a given load 
of direct traffic, the grade of service of the direct traffic and the amount 
of the S/F traffic that can be accommodated are determined. A glossary 
of symbols is given in the appendix. 


I]. MATHEMATICAL ANALYSIS 


2.1 Statistical Properties of Traffic 


The direct calls are assumed to be generated individually and col- 
lectively at random. The expected number of arrivals per hour is denoted 
as n. The expected number of arrivals during a fraction of an hour dr 
is simply given by n dr. When dr is short enough so that multiple events 
are improbable, the probability of an arrival during dr is equal to the 
expected number of arrivals: 1.e., equal to n dr. 

In the following discussion it will be more convenient to measure 
time in fractions, dt of the average holding time of the direct traffic, 
which is denoted 7’. Then the probability of an arrival during a short 
time, di, is therefore: 


Probability of an arrival during dt = nT dt = a dt. (1) 


* The assumption of lost-calls-cleared is especially appropriate when an alter- 
nate route is provided for calls that encounter the all-trunks-busy condition. 
When an alternate route is provided, the probability of loss should be interpreted 
as the portion of calls that overflow, seeking the alternate route. 
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The product n7’, which is denoted a, denotes the offered direct traffic 
measured in erlangs. It represents the expected number of calls in prog- 
ress on a fully served basis. Equation (1) indicates still another signif- 
icance of a: it is the expected number of arrivals during one holding 
time. 

The holding times of both the direct and the S/F traffic are assumed 
to be exponentially distributed (with average T for the direct and ty 
for 8/I* traffic). Under the assumption of exponential holding times, 
the probability of termination of a call with average holding time 7 
during a differential interval dz is given by dr/T plus terms negligible 
in the limit dr — 0. In fractions dt of the holding time 7’, this probability 
is simply dt. When « calls are in progress, the probability of the termina- 
tion on one of these calls during a differential interval dt is given by: 


Probability of a termination out of x calls = x dt. (2) 


Similarly, the probability of a termination out of z S/F messages in 
transmission is rz dt, where r is the ratio T/t . 

The probability of a termination during dt depends on the present 
status of the trunk group: ie., the number of calls in progress. Neither 
the probability of an arrival (1), nor the probability of a termination 
(2), depends upon the past history of the calls. This demonstrates the 
fact that under the above conditions the trunk occupancy is a Markov 
process. 

The assumption of lost-calls-cleared is used; that is, a direct call that 
encounters a condition of all-trunks-busy is cleared from that trunk 
group and does not reappear in the busy hour. In the absence of 8/F 
traffic, the trunk occupancy is a relatively simple Markov process, which 
is shown in the flow diagram in Fig. 1. Statistical equilibrium considera- 
tions lead to the following formula for the probability G, of exactly x 
trunks being busy: 

aq” 
a! 


C=. (3) 





Bé@)- = 4. (4) 
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Fig. 1 — Markov chain for trunk occupancy under lost-calls-cleared assumption, no S/F traffic. 
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2.2 Trunk Occupancy by Direct and S/F Traffic on a Lost-Calls-Cleared 
Basis 


2.2.1 One Trunk in Reserve, s = 1 


The queue of S/F traffic is given access to the trunk group only when 
the occupancy is such that one trunk will remain idle after transmission 
of an S/I* message is initiated. An S/F queue is assumed to exist at all 
times, which means that whenever two trunks in the trunk group be- 
come idle, one of them is taken for S/F transmission. When the reserve 
trunk becomes busy with direct traffic, S/F transmission is stopped on 
the trunk on which the next (in time sequence) termination of an S/T 
message occurs. This last-mentioned trunk then becomes a reserve 
trunk. 

The occupancy of trunks can be regarded as a Markov process, which 
is shown in the flow diagram of Fig. 2. The circles in the bottom row 
represent states in which one trunk is available to accommodate possible 
arrivals of direct traffic. Conversely, the circles in the top row represent 
all-trunks-busy conditions. Trunk occupancy by §/F traffic is repre- 
sented by the index z, which, in Fig. 2, increases from left to right. S, 
represents the steady-state probability of z trunks busy with 8/F traffic 
(in the bottom row): i.e., (¢ — 2 — 1) trunks busy with direct traffic 
and one trunk in reserve. R, represents the steady-state probability of 
z trunks busy with §/F traffic (in the top row): i.e., (ce — z) trunks busy 
with direct traffic and no trunks idle. 

In the steady state, the transition probabilities out of S, during dt 
are: a dt Sz through a new arrival (transition into R,) and (¢ — z — 1) 
dt S, through a termination of one of (¢ — z — 1) direct calls in progress 
(into S,4; since, as soon as two trunks are recognized as idle, one of 
them is seized for S/F transmission). 

Transition probabilities into S, are: (c — z)dtR, through a termina- 
tion of one of (¢ — z) direct calls in progress in state R, (in the time 
interval after an arrival of a direct call and before the next termination, 
in time, of an S/I’ message); (2 + 1)r dtR.4, through a termination of 
one of the (z + 1) S/F messages in progress in state R.4, ; and (c — z) 
dt S,_, through a termination of one of the (¢ — z) direct calls in prog- 
ress in state S,1. Equating the transition probabilities out of S, to 
transition probabilities into S, results in the following equation: 


(ate—z-1)8.= (ec -—2z)S.44+ (e+ 1)r Rea t+ (ec —2z)Rz. (5) 
The transition probabilities out of R, during dé are: (c — 2) dt R, 
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Fig. 2 — Markov chain for trunk-occupancy under lost-calls-cleared assumption, dual use of trunks, one trunk in reserve. 
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through a termination of one of (¢ — z) direct calls in progress (into 
S.) and zr dt R. through a termination of one of z S/I* messages in prog- 
ress (into S,_,). The transition probability into R, is adt S, through an 
arrival of a direct call. 

It should be pointed out here that if a new, direct arrival occurs when 
the chain is in state R,, one call is lost (or takes another route), but 
transition to another state does not occur under the assumption of lost- 
calls-cleared. 

Equating the transition probabilities into 2. to transition probabil- 
ities out of R, results in the following equation: 


aS. = (ec —z+ar)R. = [e + e(r — 1))R.. (6) 
Subtracting (6) from (5) results -n 
(2+ 1)rRiyy = (Cc —2—1)8. —(e —z)Sz1+earR.. (7) 
This relation is satisfied if the following holds: 
(2+ \rRig = (e —2—1)8, (8) 
and therefore, 
arR, = (¢ — 2)Sz4. (9) 


Actually, the recurrence relation of (8) and (9) can be observed directly 
by equating the transition probabilities through the appropriate cut-set 
in Tig. 2. 

We will express first R, and S,,0 S$ z S (ce — 1), in terms of Ry, 
the probability that all trunks are busy with direct traffic. Combining 
(6) and (8) shows that 


ar(z + 1)Rey = (C —2 — 1) [e+ 2(r — IIR. (10) 
and therefore, 
areR, = (ec — z)le + (2 — 1)(r7 — 1) JR. (11) 


Iteration of (11) results in the following expression for R, : 


[c + O(r — 1)][e + 1(r — 1)]le + 2(r — 1)] 
--+ [e+ (2 —1)(r — 1)] 
a! re (12) 


feed Ore ae (¢ — 2) p 
a? 


Rk, = 





From (12) and (6), 
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[e + O(r — 1)]le + 1(r — 1)lle + 2(r — 1)] 
Se ee 
° zl r (13) 





NOS 1) Ce 2) oes (¢ — 2) p 


q?tl 


Now, Rp can be determined for a given load a by noting that the sum 
of probabilities R, and S, adds up to unity: 


c—l 


> k, +28, = 1. (14) 


Introducing R, and S, from (12) and (13), respectively, 


le + O(r — 1)]le + 1(r — 1)]le + 2(r — 1)] 


1_S -[e + (2 —1)(r¥ — 1)] 
Ry 2a 2! r 
He Te 92) ass (ez) 
a” 
[c + O(r — 1) lle + 1(r — 1) le ea i 1)] ' 
<I -+» [eo + 2(r — 1)] 
+2 rae id 
Ae 1) (O82) ore 2) C15) 
qth 


= fatet+e(r—1)} 
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ll 
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Jes Ot Wile Lae) se le oe Ire TI 


z! r? 





Kee lea 2) ee (Cog) 
qzti : 


The probability of loss corresponds to the probability that all trunks 
are busy: i.e., the sum of the probabilities R, . This can be referred to as 
the dual-use-of-trunks formula, D(c,a,r,1) for ¢ trunks, a erlangs of 
offered direct load, ratio r of holding times, and one trunk in reserve. 
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D(c,a,rj1) = > P. 
_le + O(r — 1)]le + ue —1)] 
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The amount of S/F traffic in erlangs that can be accommodated is 
denoted b. It is given by 


c—1 c—1 
b = >> 28, + do ek.. (17) 
z=0 z=0 


Equations (16) and (17) constitute the results sought. 

The load b can be determined by a simpler formula when the grade 
of service D(c,a,r,1) has been calculated. This formula can be arrived 
at by the following reasoning. The load in terms of trunks occupied is 
c — 1 at all times, plus one additional trunk with probability D(c,a,r,1). 
The erlang load in terms of traffic carried is b for the 8/F traffic and 
all — D(c,a,r,1)| for the direct traffic. [The portion aD(c,a,r,1) is 
cleared under the assumption of lost-calls-cleared.| Equating the two 
results yields the following equation, 


b + afl — D(c,a,r,1)] = ¢ — 1 + D(ce,a,r,1) (18) 
and therefore 
b=c—1-—all — D(c,a,r,1)| + D(c,a,r,1) 
(19) 
=c— (a+ 1)[l — D(e,a,r,1)]. 


It should be pointed out that for r — © the probabilities R, — 0 
as 1/r (except for Ry). In this limiting case the probability of loss tends 
to B(c,a) with the difference—i.e., the impairment due to the S/F traf- 
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fie—being of the order of 1/r. This can be written 


lim [D(c,a,r,1) — B(c,a)] = 0 (*). (20) 
The probabilities Ry, Sy, Sy ,°'* S--1 tend to become, respectively, 
Ge, Ger, Ges tre ig? Go. 


2.2.2 Two Trunks in Reserve, s = 2 


In this case, the queue of S/F traffic is denied access to the trunk 
group unless two trunks will remain idle after transmission of an 8/I 
message is initiated. A queue of S/T’ traffic is again assumed to exist at 
all times, which means that as soon as three trunks in the trunk group 
become idle, one of them is taken for transmission of an S/I° message. 
When one (or two) of the reserve trunks becomes busy with the direct 
traffic, S/F transmission is stopped on the trunk on which the next (in 
time sequence) termination of an S/F message occurs. The flow diagram 
of the process is shown in Fig. 3. In states S, two trunks are idle; in 
states W, one trunk is idle; in states R, all trunks ase busy. Thus, in 
state S,, z trunks are busy with the 8/F traffic and (c ~— z — 2) with 
the direct traffic; in state W, , z trunks are busy with the S/F traffic and 
(c — zg — 1) with the direct traffic; in state R, , z trunks are busy with 
the 8/I traffic and (c — z) with the direct traffic. The transition prob- 
ability coefficients are as given in the I'ig. 3. 

Three basic equations are presented now, which state that the net 
transition flow through three cut-sets is zero under the steady-state con- 
ditions. The three cut-sets are indicated in lig. 3 by lines of dots. 


aS. + (2+ l)rRig = (Cc —2 —-1)W.+ (e —2z—1)S8.4 (21) 
(ec —z—1)S.4 = erR. + erW, (22) 
(c—z+er)Rz = [c+ 2(r — 1)]Rz = aW,. (23) 
On the basis of (23), (22) can be rewritten as follows: 
(ec—~z2—1)8,4= ar +o UTR 


a 
= E Te tate =| ie 


2 


(24) 


and similarly, 
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Cr eee eae E + ct EFDG— DY) oe 
(25) 


Wiy- 





nen 
c+ (2+1)(r-1) 


Introducing the relation from (25) into the left side of (21) and the 
relation from (24) into the right side of (21), we obtain 


c—2z2-—2 
gli $in et? —_|6, 


= | ne (1 + +=) W. (26) 


= E ha 1) 1+ aes =| W.. 


= @+Ur[ 1+ 





We assume / is known and proceed to derive the other probabilities in 
the following manner: W, is determined from R, through the use of (23), 
which is rewritten in a convenient form as (27). S, is determined from 
W, through the use of (26), which is rewritten in a convenient form as 
(28). Rz4, is determined from S, through the use of (25), which is 
rewritten in a convenient form as (29): 





Wee —— R. (27) 
zra 
R cee oare arcane (28) 
° afi+ e—z-2 | : 
atect+(z2+1)(r -—1) 
= (c—z—-2)a 
me GfliererePDE=DI wy 
We obtain, using (27), for z = 0, 
W, = CtOr = DIE (30) 


a 


Using (28), for z = 0, 
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Using (29), for z = 0, 
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Using (27), for z = 1, 
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Using (28), for z = 1 
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Jeter — 1-14 
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o(kK + 1)rle+ (K+ 1)(r - oi 
Now fy can be determined, since 


c—2 


c—2 c—2 
> Rk. + DW. + DS, = 1. (38) 
2z=0 2z=0 é=0 


The probability of loss D(c,a,r,2) is equal to the probability that all 
trunks are busy 


c—2 


D(ca,r,2) = > R:. (39) 
z=0 


When values for R, obtainable from (35), (36), (87), and (38) are used 
in (89), it can be regarded as the dual-use-of-trunks formula D(c,a,r,2) 
for c trunks, a erlangs of offered direct load, ratio of holding times r, and 
two trunks in reserve. 
The amount b of the S/I* traffic in erlangs that can be accommodated 
is given by 
c—2 


b = Dek, + 2 eW. +2 a8,. (40) 


It should be pointed out that for r— «, the probabilities R, — 0 as 
1/r’ (except for Ry) and the probabilities W. — 0 as 1/r (except for 
W,). In this limiting case, the probability of loss tends to B(c,a) with 
the difference —i.e., the impairment due to S/F traffic — being of the 
order 1/7”: 


hn (card) Beare o (4). (41) 


r2 


The probabilities Ry , Wo, So, Si, +++ Sc-2 tend to become, respectively, 
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Ge, Goat, Geo , Ge_3 +++ Go. The load b becomes 
b=c —2— all — D(c,a,r,2)] + 2G. + Gor. (42) 


This can be seen by the following reasoning. The load in terms of trunks 
occupied is (c — 2) trunks at all times, one additional trunk with the 
probability G._; and two additional trunks with the probability G,. 
The erlang load in terms of traffic carried is b for the S/F and afl — 
D(c,a,r,2)| for the direct traffic. Equating the two results yields (42). 


III. RESULTS AND CONCLUSIONS 


The recurrence relations for trunk occupancy probabilities as derived 
above were programmed on an IBM 704 computer by Miss B. Berman. 
Two types of programs were written: 

1. The grade of service of direct traffic and the amount of S/F traffic 
accommodated are calculated for a given amount a of direct traffic, a 
given number of trunks in a trunk group, and either one or two trunks 
in reserve. 

aw. The amount a of direct traffic and the amount of S/F traffic are 
calculated for a given grade of service assigned to direct traffic, a given 
number of trunks in a trunk group, and either one or two trunks in re- 
serve. This computer program is similar to (2) except that it involves 
successive approximations in the amount of direct traffic to obtain the 
desired grade of service. 

These computer programs were used for trunk groups ranging in size 
from 2 to 48 trunks. Special attention was given to one ratio of the aver- 
age holding time of direct traffic to the average holding time of S/I 
traffic, r = T/to = 300/3.6 = 83.3. This ratio was selected to correspond 
to the average holding times which were originally expected in the 
UNICOM system, for which the study was initially done. For the direct 
trafic, T = 5 minutes = 300 seconds, and for the S/F traffic, to = 3.6 
seconds. This last figure was obtained by assuming 150 words for the 
average message length, 42-bits-per-word digital coding, and 2400-bits- 
per-second transmission speed. Approximately one second is allowed in 
the total for the average time required to establish the connection. Thus, 
on the average, for one fully loaded trunk, 1000 S/I° messages per hour 
is the equivalent of only 12 direct calls. 

Calculations in the first type of program above were made for offered 
loads of direct traffic determined by the formula B(c,a) = 0.05. The 
resulting grade of service of direct traffic and the amount of S/F traffic 
carried for r = 83.8 are presented in Table I. (The grade of service is 
plotted as a solid line in Figs. 4 and 5.) It can be seen that substantial 


TaB.LE I — D(c,a, 83.3,s) AND b For B(c,a) = 0.05 


























One Trunk in Reserve Two Trunks in Reserve 
c a 
D(c,@,83.3,1) b D(c,a,83.3,2) b 
2 0.38132 0.05255 | 0.69127 = = 
3 0.89940 0.05477 1.20463 0.05003 0.41676 
4 1.52462 0.05670 1.61853 0.05008 0.79092 
5 2.21848 0.05842 1.96954 0.05014 1.11664 
6 2.96033 0.05996 2.27714 0.05021 1.40371 
7 3.73782 0.06137 2.55294 0.05028 1.66049 
8 4.54297 0.06267 2.80439 0.05036 1.89308 
9 5.37025 0.06387 3.03662 0.05044 2.10599 
10 6.21572 0.06500 3.25327 0.05052 2.30257 
12 7.95007 0.06704 3.64998 0.05069 2.65643 
16 11.54361 0.07053 4.34111 0.05105 3.25047 
20 15.24928 0.07344 4.94398 0.05140 3.74232 
24 19 .03073 0.07592 5.49001 0.05176 4.16546 
30 24 .80184 0.07908 6.23847 0.05229 4.71144 
36 30.65736 0.08173 6.92986 0.05279 5.18284 
48 42.53693 0.08598 8.20641 0.05374 5.98058 

















IMPAIRMENT IN GRADE OF SERVICE [D(c,a,1,1)-B(c,a) | 

















6 12 18 24 30 36 42 48 54 
NUMBER OF TRUNKS IN GROUP (C) 


Fig. 4 — Grade of service D(c,a,r,1) with a given by B(c,a) = 0.05, vs number 
of trunks c. 
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Fig. 5 — Grade of service D(c,a,7r,2) with a given by B(c,a) = 0.05, vs number 
of trunks c. 


amounts of S/F traffic can be carried. When one trunk is held in reserve, 
the amount ranges from 0.7 erlang (700 messages per hour) in the case 
of a trunk group with 2 trunks, to 8.2 erlangs (8200 messages per hour) 
in the case of a trunk group with 48 trunks. When two trunks are held 
in reserve, the amount ranges from 0.42 erlang (420 messages per hour) 
in the case of a trunk group with 3 trunks, to 6.0 erlangs (6000 messages 
per hour) in the case of a trunk group with 48 trunks. 

The impairment in grade of service to direct traffic — 1.e., the differ- 
ence between the grade of service under the dual use of trunks and the 
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grade of service when no §/F traffic is sent (for the same amount of 
direct. traffic) — is rather small when 1 trunk is held in reserve and 
practically insignificant when 2 trunks are held in reserve. The impair- 
ment increases from 0.003 for the 2-trunk group to 0.036 for the 48-trunk 
group when J trunk is held in reserve, and from 0.00003 for the 3-trunk 
group to 0.0037 for the 48-trunk group when 2 trunks are held in reserve. 

Similar calculations were carried out for different ratios r of holding 
times, ranging from 100 to 1. The grade of service to the direct traffic 
is plotted, with r as a parameter, as dashed lines in l’igs. 4 and 5. The 
impairment increases rapidly with decreasing r. If the impairment were 
plotted as a function of r, it can be seen that for r > 10 the impairment 
obeys closely the asymptotic functional relation 1/r of (20) for s = 1 
and 1/r? of (41) for s = 2. For lower r, the impairment increases more 
slowly than 1/r for s = 1, or 1/r? for s = 2. The capacity b for carrying 
S/l* traffic increases slowly with the increase of probability of loss, 
which is indicated for s = 1 by (19). 

Calculations in the second type of program were used to obtain the 
offered amount of direct traffic, which will result in grade of service 
0.05 under the dual use of trunks: i.e., load a in erlangs determined by 
D(c,a,83.3,1) = 0.05 and by D(c,a,83.3,2) = 0.05. The amount b of 
S/I* traffic was computed at the same time. These results are presented 
in Table II and lig. 6. Comparing dual use of trunks having | trunk in 
reserve with use of trunks for direct traffic only at the same grade of 
service, it can be seen that for the 2-trunk group, 0.7 erlangs of S/F 


TABLE IJ —a Anp b For D(c,a,83.3,s) = 0.05 




















One Trunk in Reserve Two Trunks in Reserve 
D(c,a,83.3,1) = 0.05 D(c,a,83.3,2) = 0.05 
c 
a 6 a b 
2 0.36953 0.69894 — —_ 
3 0.85954 1.23344 0.89912 0.41686 
4 1.44427 1.67795 1.52364 0.791438 
5 2.08803 2.06637 2.21629 1.11790 
6 2.77203 2.41657 2.95642 1.40608 
7 3.48519 2.73907 3.73167 1.66431 
8 4.22041 3.04060 4.53409 1.89868 
9 4.97292 3.32572 5.35812 2.11369 
10 5.73925 3.59771 6.19983 2.31268 
12 7.30392 4.11127 7.92525 2.67221 
16 10.52058 5.05545 11.49555 3.28063 
20 13 .80962 5.93086 15.17145 3.79025 
24 17.14272 6.76441 18.91730 4.23396 
30 22,.19328 7.96639 24 .62530 4.81501 
36 27 .28129 9.13277 30.40758 5.32546 
48 37.51678 11.40906 42.11484 6.21013 
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* a DETERMINED BY 
D(c,a,83.3,1)=0.05 





b DETERMINED BY 
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D(c,a, 83.3, 1)=0.05 























DIRECT LOAD OFFERED IN ERLANGS(a) 
S/F LOAD ACCOMODATED IN ERLANGS (b) 











NUMBER OF TRUNKS IN GROUP (C) 


Fig. 6 — Load of S/F traffic b and offered load of direct traffic a defined by 
D(c,a,83.3,1) = 0.05 and by D(c,a,83.3,2) = 0.05, vs number of trunks c. 


traffic (700 messages per hour) can be sent at the cost of decreasing the 
offered direct load from 0.381 erlang to 0.369 erlang: i.e. by 0.012 erlang 
(a fraction 0.14 of the average holding time of a direct call). For the 
48-trunk group, 11.4 erlangs of S/F traffic (11,400 messages per hour) 
can be sent at the cost of decreasing the offered direct load from 42.5 
erlangs to 37.5 erlangs: i.e. by 5.0 erlangs (60 direct calls). Making a 
similar comparison when two trunks are held in reserve, it will be noted 
that if the offered load a determined by B(c,a) = 0.05 were to be plotted 
in Fig. 6, it would virtually coincide with the plotted offered load a 
determined by D(c,a,83.3,2) = 0.05. Thus, the decrease in direct capacity 
is negligible. In Table III an example is presented of trunk occupancy 
probabilities which result from a calculation in the second type of pro- 
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TaBLE III — Trunk Occupancy PROBABILITIES 





























One Trunk in Reserve Two Trunks in Reserve 
D(5,a,83.3,1) = 0.05 D(5,a,83.3,2) = 0.05 
z Zz 
R; Sz Rz W, Sz 

0 0.03912 0.09368 0 0.04977 0.11229 0.19610 
1 0.00450 | 0.18807 1 0.00018 | 0.00688 | 0.27160 
2 0.00338 | 0.27508 2 0.00004 | 0.00322 | 0.24702 
3 0.00220 | 0.26559 3 0.00001 0.00098 0.11190 
4 0.00080 | 0.12758 

Total 0.05000 0.95000 | 0.05000 | 0.12337 0.82663 





gram; probabilities 2, and S, for D(5,a,83.3,1) and the probabilities 
R,, W., and S, for D(5,a,83.3,2) are given. 

These calculations were repeated for different values of r ranging 
from 100 to 1. The capacity a for carrying direct traffic determined by 
D(c,a,r,1) = 0.05 is plotted in Fig. 7 and by D(c,a,r,2) = 0.05 in Fig. 8. 

The following conclusions can be drawn from the numerical results. 
The dual use of long-haul trunks for both direct and 8/F traffic can be 
economically attractive. The calculations indicate that when one or two 
trunks are held in reserve for possible arrivals of direct traffic, it is pos- 
sible to send large amounts of S/F traffic on the same trunk groups as 
direct traffic with little or negligible impairment of the direct traffic, 
provided the parameters of group size and the number of trunks held 
in reserve are properly selected. The dual use of trunks is especially 
attractive if the average holding time of the S/F traffic is short compared 
with the average holding time of the direct traffic. When the holding 
time of the S/I* traffic approaches the average holding time of the direct 
traffic, the capacity for carrying direct traffic deteriorates sharply, unless 
two trunks (or more, for large trunk-groups) are held in reserve. 

The potential economies are believed to be of greatest importance in 
systems with small trunk groups. A manipulation of the data (1 trunk 
in reserve) will serve to point out that the gain in trunk group efficiency 
is very substantial in the smaller groups and decreases as the number 
of trunks increases. The trunk efficiency is the load carried divided by 
the number of trunks. The load carried is [1 — B(c,a)Ja = 0.95a (from 
Table I) when no §/F traffic is present, and 


[1 — D(c,a,r,1)la + b = 0.95a + b 


(from Table IT) under the dual use of trunks. The 2-trunk group increases 
in efficiency from 18 to about 52 per cent, the 10-trunk group from 59 
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Fig. 7 — Offered load of direct traffic a defined by D(c,a,r,1) = 0.05, vs ratio 
of holding times r. 


to about 90 per cent, and the 48-trunk group from 84 to about 98 per 
cent. These advances are significant in terms of possible savings in long- 
haul transmission plant. 

This method of operating circuit groups at such a high level of occu- 
pancy remains to be evaluated in terms of the grade of service of the 
S/F user. The grade of service of the S/F traffic has not been discussed 
here. : 

The analysis given here was confined to a single trunk group carrying 
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Fig. 8 — Offered load of direct traffic a defined by D(c,a,r,2) = 0.05, vs ratio 
of holding times r. 


traffic between two points. The problems of alternate routing and 
through-switching of traffic are yet to be explored. 
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Glossary of Symbols 


a 


B(c,a) 


D(c,a,r,8) 
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Offered amount of direct traffic, in erlangs, during busy hour. 

Amount of S/F traffic per busy hour that can be accommo- 
dated, in erlangs. 

Number of trunks in a trunk group. 

Number of trunks held in reserve. 

Average holding time of S/I* traffic. 

Average holding time of direct traffic. 

Ratio of the average holding time of direct traffic to the 
average holding time of S/F traffic, T'/to . 

Grade of service to direct traffic on lost-calls-cleared basis, 
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with c trunks in the trunk-group, offered load a of direct 
traffic, ratio of holding times 7, and s trunks in reserve for 

possible arrivals of direct traffic. 

Probability of exactly x trunks being occupied by direct 
traffic. 

Expected number of arrivals of direct traffic during the 
busy hour. 

Probability of, or state of, all-trunks-busy, with S/I* traffic 
on z trunks. 

Probability of, or state of, specified number of trunks (one 
when s = 1, two whens = 2) in reserve, with S/I* traffic 
on z trunks. 

Probability of, or state of, one trunk in reserve, with S/F 
traffic on z trunks (under the operating method where 
8s = 2). 

Number of trunks occupied by direct traffic. 

Number of trunks occupied by S/F traffic. 
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Over-All Characteristics of a TASI System 


By J. M. FRASER, D. B. BULLOCK, and N. G. LONG 
(Manuscript received September 19, 1961) 


TASI (Time Assignment Speech Interpolation) has been in service on 
transatlantic submarine cable channels since mid-1960. Measurement of 
service quality on one TASI system (White Plains—London) indicates 
that system performance equals or exceeds the original engineering objec- 
tives an all but a few cases. Field modifications now being made should 
bring these exceptions into closer agreement with objectives. 

A companion paper discusses in detarl the design considerations for 
TASI specch detectors and describes subjective tests made to determine the 
maximum permissible loading of TASI circuits without impairment of 
service. 


TASI, an abbreviation of Time Assignment Speech /nterpolation, is 
a high-speed switching and transmission system which uses the idle time 
in telephone calls to interpolate additional talkers.!? In a normal tele- 
phone conversation each subscriber speaks less than half of the time. 
The remainder of the time is composed of listening, gaps between words 
and syllables, and pauses while the operator or subscriber leaves the 
line. Measurements on working transatlantic channels, lig. 1, show that 
a TASI speech detector with a sensitivity of —40 dbm is operated by 
speech from one talker on the average about 40 per cent of the time the 
circuit is busy at the switchboard. Since long distance circuits use sep- 
arate facilities for the two directions of transmission, each one-way 
channel is, on the average, free about 60 per cent of the time. 

In order to take advantage of this free time to interpolate additional 
conversations, a considerable group of channels must be available. An 
attempt to interpolate two independent conversations on a single channel 
would result in a large percentage of the speech being lost, since the 
probability of both talkers speaking at the same time is high. However, 
with a large group of channels serving a larger group of talkers, the vari- 
ations in demand become much smaller. Even with 74 talkers on 37 
channels, the percentage of speech lost (freeze-out fraction) is reduced to 
a point where there is no noticeable effect on continuity of conversation. 
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Fig. 1 — Circuit activity. 


The increase in channel capacity with TAST is illustrated in Fig. 2, 
which gives the TASI advantage (ratio of switchboard positions, or 
trunks, to channels) for a range of activities and number of channels. 
A freeze-out fraction of 0.5 per cent has been assumed for each curve, 
since this amount of speech loss has been found from tests to have a 
negligible effect on transmission quality. 

It will be noted in Fig. 2 that a TASI advantage of at least two can 
be obtained on a 37 channel group as long as the average activity is not 
significantly greater than 40 per cent. TASI is designed to use 36 chan- 
nels for speech interpolation and one additional channel as a control 
channel for transmitting disconnect and error checking signals. This 
fits the needs of present day submarine cable systems, since 37 is close 
to the maximum number of channels that can be made available for 
TASI out of the total 48 channels derived by submarine cable type 
channel equipment? employing 3-kc filter spacing. The remainder of the 
channels are required for special services such as program material and 
certain types of data which are ordinarily not transmitted through TASI. 
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Fig. 2 — TASI advantage. 


The first TASI system was put in service in June, 1960 on the trans- 
atlantic cable system between White Plains and London (TAT-1) and 
the second followed a few months later on the cable between New York 
and Paris (TAT-2). TASI is well suited to submarine cable application 
for several reasons. Although TASI requires considerable expensive ter- 
minal equipment, it is an economical means of doubling the number of 
conversations that can be handled on expensive submarine cable facil- 
ities. In addition, TASI is easier to apply to long submarine cables than 
to the land plant with its many branching points and alternate routes. 

The principal purpose of this paper is to describe the application of 
TASI to the White Plains-London submarine cable system. The channel 
requirements that must be met for TASI operation are detailed along 
with measurements of the combined system characteristics, such as 
noise, bandwidth, etc. Measurements of the amount of speech lost in 
an actual working system are compared with earlier theoretical com- 
putations. 


I. DESCRIPTION OF OVER-ALL SYSTEM 


The message channels on the land portions of the TAT-1 system em- 
ploy standard 4-kc channel spacing, and the undersea channels are 
spaced at 3-ke intervals. The land and undersea-type channels are inter- 
connected at voice frequencies at Sydney Mines, Nova Scotia and Oban, 
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Fig. 3 — TASI equipment — one direction of transmission. 


Scotland. A typical message circuit between the TASI terminals at 
White Plains and London extends from about 275 to 3150 eps. 

Fig. 3 shows a block diagram of the TASI equipment for one direc- 
tion of transmission; an independent TASI is used in the opposite direc- 
tion of transmission. Presence of speech on a trunk causes the speech 
detector to operate, initiating a request for a channel. The transmitting 
common control equipment selects an idle channel, if one exists, and 
assigns it to the requesting trunk. Before the talker is connected to the 
channel, a “‘connect” signal is sent over the assigned channel specifying 
the trunk to be connected to that channel at the distant receiving ter- 
minal. During the time required to connect talker and listener the initial 
part of the talker’s speech is clipped. In order to minimize clipping the 
signaling time has been made as short as possible, 17 ms, consistent with 
reliable signaling and quiet switching. The signal information consists 
of a single burst of 4 tones out of a possible 14, ranging in frequency 
from 615 to 2419 cps. Once a talker is assigned a channel he does not 
lose the connection as long as he continues talking. When he stops talk- 
ing he may still retain the connection unless he has to be disconnected 
to provide a channel for another talker. A similar burst of 4 tones out of 
15 (615 to 2501 eps) is used to disconnect the talker and the listener, 
but this signal is sent over a separate control channel. During periods 
when no disconnect signals are being used, the same type of code sig- 
nals are used to send information over the control channel as to the 
trunk-channel connections existing at the transmitting end. This con- 
nection-checking information overrides any earlier information and 
determines the connection made at the receiver. In addition, a compari- 
son at the receiver between existing and OvERIGINE information is used 
to detect bad channels. 

~ As shown in I’ig. 2, the number of trunks which can be served by TASI 
depends upon the number of channels available. To prevent excessive 
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speech loss when the connecting channels fail, TASI has been designed 
to automatically remove bad channels from service; trunks are then 
removed until the proper trunk-channel ratio is reached. 

In addition to the provisions for automatically reducing the number 
of connected trunks and channels, TASI contains audible and visible 
alarms to identify internal failures. In the event of a major failure in 
TASI the terminals automatically switch themselves out at both ends, 
reducing the number of connected trunks to the number of available 
channels. TASI is also switched out automatically if both the regular 
and alternate control channels fail. If only the regular control channel 
fails, the disconnect and error checking signals are automatically switched 
to the alternate control channel and TASI will continue to operate with 
only a momentary interruption. 

When TASI is switched out, the voice-frequency amplifiers associated 
with TASI are also switched out. The schematic relationship of these 
voice-frequency amplifiers and other transmission equipment is shown 
for one terminal in Fig. 4 along with typical operating level points. The 
combination of VF amplifiers, TASI, and appropriate attenuation pads 
provides a zero-loss device and also provides optimum transmission levels 
to TASI. 

The echo suppressor shown in Fig. 4 performs the usual function of 
preventing echoes, generated at points of impedance mismatch, from 
reaching a subscriber’s ear and interfering with normal conversation. 
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Fig. 4 — TASI and associated equipment. 
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On TASI circuits, these suppressors must be of the receiving end split 
type at each end of the circuit to prevent the distant talker’s echo from 
operating the speech detector. The location of a split suppressor is shown 
schematically in lig. 4. Speech received from the distant party reaches 
the echo suppressor and operates it, causing a large loss to be inserted 
in the transmitting path before the speech detector. This large loss pre- 
vents most of the echoes capable of operating the speech detector from 
reaching the speech detector, but the initial part of the echo may get 
through because the existing echo suppressors were designed with a 
slow operate time to minimize operation on line noise. On the other 
hand, the speech detectors must be relatively fast operating to mini- 
mize initial clipping of speech. The result is that the operate time of the 
echo suppressors is about 7 ms longer than that of the speech detectors. 
During this interval the speech detector may be operated by echoes be- 
fore the echo suppressor can operate. To minimize this difficulty, and 
still use existing echo suppressors, the speech detectors are equipped with 
fast-acting circuits which reduce the sensitivity of the speech detectors 
as much as 13 db depending on the energy present on the receiving side 
of the trunk. This echo protector function of the speech detector reduces 
the probability of operating during the initial part of the spurt; the 
large loss inserted later by the echo suppressors prevents operation 
during the remainder of the spurt, which usually contains the higher 
energy. 

In addition to the equipment shown in I‘ig. 4, compandors can be 
applied to noisy channels as necessary to meet noise objectives. Tests 
have indicated that TASI can signal satisfactorily through Bell System 
1A-type compandors or their British equivalent. 


1.1 Toll Signaling and Supervision 


Because TASI is a time sharing device, there are problems involved 
in transmitting supervisory and dialing pulses. TASI can work satis- 
factorily with the present rigdown manual arrangement, but it is ob- 
vious that the usual method of continuous supervision by means of a 
steady tone during the idle time cannot be used. Likewise, dial pulses 
cannot compete for a TASI channel on the same basis as a talker, be- 
cause TASI clipping would cause signaling errors. A burst signaling sys- 
tem is required. 


Il. ENGINEERING OBJECTIVES FOR TASI 


In order that TASI could operate over existing telephone facilities 
and would fit in with existing performance standards, certain engineering 
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TaBue I — ENGINEERING OBJECTIVES FoR TASI 


Capacity 


Speech quality 


Signaling errors 


Reliability 


Frequency re- 
sponse 


Net loss 


Circuit 


Crosstalk 











At least 72 message trunks to be operated over 37, 3-ke 
spaced cable channels. If the number of available channels 
is less than 37, the number of trunks to be provided by 
TASI will be less, as illustrated in Fig. 2. If the total busy- 
hour speech activity is increased above about 40%, the 
maximum number of trunks to be provided will be less. 


With the TASI system fully loaded as defined above, the 
degradation to speech quality due to TASI should not 
exceed about 1 db. When the number of talkers equals the 
number of available channels, the TASI degradation 
should be close to 0 db. 


On the average, during the busy hour, no more than 0.01% 
of the talkspurts transmitted should be lost because of 
signaling errors if the transmission medium meets the 
objectives noted in Table II. Assuming that the average 
activity is 40%, this means about one talkspurt lost in 
thirty average 10-minute calls. 


The reliability objective is that the amount of time trunks 
are removed from service because of TASI failure shall be 
less than 0.1% of the total time. 


The TASI transmitter and receiver connected back to back 
should pass a band of 200-3500 cps. The average variation 
from flatness of all the channels should be within +0.5 db 
over this frequency range. In addition the standard devia- 
tion of the variations from the average should not exceed 
0.2 db. 


The net loss at 1000 eps through the TASI equipment alone 
should be adjustable to within 0.15 db of 0 db and should 
stay within +0.15 db of the adjusted value for at least 
one month. 


The noise generated by TASI in the transmission path should 
not exceed about 12 dba as measured at the zero level 
points. 


To provide adequate crosstalk performance, an equal level 
coupling loss of 70 db should be obtained between talking 
paths in TASI. This applies to both near-end and far-end 
crosstalk. 





objectives were set up to guide the planning and development of TAST. 
They were considered as reasonable goals rather than rigid requirements. 
These objectives are listed in Table I. 


III. CHARACTERISTICS OF CHANNELS FOR TASI 


Because of the high-speed signaling used in TASI and because sub- 
scribers are switched rapidly between channels, the transmission re- 
quirements of the channels connecting TASI terminals are somewhat 
tighter than required for the usual telephone message service. The char- 
acteristics of importance to TASI are listed in Table II. 
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TABLE II — RequirED TRANSMISSION CHARACTERISTIC OF 
CONNECTING CHANNELS FoR TASIT 





Minimum bandwidth 


Flatness of band 565 to 2550 
cps 


I-ke net loss value of any 
channel 


Iinvelope delay distortion 
565 to 2550 cps 


Flat delay 


RMS noise 


Crosstalk 


Frequency stability (565 to 
2550 cps) 

Working levels for TASI 
(excluding pads or ampli- 
fiers outside of TASI) 








300 eps-2900 cps (10-db cutoff frequencies). 


Difference between maximum and minimum loss 
should not exceed 2.5 db. 


Not more than +3 db from the nominal value. 
Not greater than 2 ms. 


Maximum difference between channels should be 
no more than 10-15 ms at 1000 eps. (The control 
channel should be one of the fastest.) 


Without compandors, 38 dba at zero transmission 
level (88 dba 0). With compandors, noise on line 
ahead of compandors should not exceed 51 dba 0. 

Difference in output noise between channels 
should not exceed 6 db. 


Equal level crosstalk loss on all channels should 
be at least 60 db. 


No frequency shifted more than 2 eps. 


Transmitting terminal input, —2 with respect to 
zero transmission level point. Receiving ter- 
minal input, +4 with respect to zero transmis- 
sion level point. 


IV. MEASURED TRANSMISSION CHARACTERISTICS 


After TASI was installed on TAT-1 extensive measurements and tests 


were made to determine how closely TASI came to mecting the engineer- 
ing objectives and how TASI operated in the environment of the tele- 
phone plant. The results of the tests and measurements are given briefly 
in the following paragraphs. 


4.1 Frequency Response 


Fig. 5 shows the frequency response of a TAT-1 message channel, 
with and without TASI. Figure 5 applies to most connection channels, 
although a few channels differ significantly due to channel bank and 
pilot filters. The sharp cutoff frequencies of the channel, about 275 and 
3150 cps, are principally due to the 3-ke submarine cable terminal 
equipment. It can be seen that TASI does not affect the frequency re- 
sponse of the system appreciably. 
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Fig. 5 — Frequency response — TAT-1. 


4.2 Net Loss Variations 


The net loss variations on TAT-1 are kept to a minimum through the 
use of pilot-controlled automatic gain controls. An associated alarm re- 
porting system informs all important stations automatically if the pilot 
levels deviate beyond prescribed limits. 

Fig. 6 shows a distribution of the difference in net loss of successive 
talkspurts experienced by a typical White Plains-London subscriber 
talking through TASI during the busy hour. These changes are due to 
differences in the net losses of the channels and the various paths through 
TASI. Only 8 per cent of the changes were greater than about 3 db, 
which is just noticeable. There were no changes greater than about 5 
db. During periods of light traffic, the number of switches between 
channels, and the number of net loss changes, will decrease and cease 
entirely if traffic is very light. 


4.3 Envelope Delay Distortion 


A fixed delay equalizer was added to each of the TAT-1 message chan- 
nels to reduce the delay distortion. In addition, the voice-frequency 
interconnections between land and undersea channels were arranged 
to avoid combinations having excessive delay distortion. The median 
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Fig. 6 — Net loss differences. 


value of delay distortion of the resulting White Plains-London message 
channels is under 0.5 ms, with no channel being worse than 1.6 ms, within 
the TASI signaling band (565 to 2550 c/s). 

lig. 7 shows the delay distortion of a typical message channel before 
equalization, and after the addition of equalizers plus TASI. Channels 
located in the frequency spectrum near the cutoffs of group connector 
filters and pilot-frequency filters have delay distortion characteristics 
significantly different from the example on Fig. 7, particularly at the 
lower and upper edges of the passband. 


4.4 Nozse 


TASI contributes very little to the over-all system noise; the average 
of the TAT-1 channels is about 36 dba at zero transmission level (ab- 
breviated “36 dba 0”), and the average noise generated by TASI plus 
amplifiers is less than 12 dba 0. Since noise is increasing slowly on the 
transatlantic circuits caused by the change in cable characteristics with 
time, compandors have been installed on some channels. This will not 
influence the operation of TASI. 
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Fig. 7 — Envelope delay distortion typical TAT-1 Channel. 


When a channel is disconnected during a conversation, the listener 
may notice a change in the background noise, because he no longer hears 
the channel noise of the whole system but instead hears only the noise 
of the trunk connecting him to TASI. To prevent the subscriber from 
feeling that he has lost his circuit due to the sudden noise change be- 
tween successive connections, the receiving TASI terminal transmits 
random noise of about 33 dba 0 toward the listener whenever his trunk 
is not connected to a channel. 


V. SPEECH CLIPPING 


In addition to the measurements described above, measurements were 
made of the amount of speech clipped from subscribers making calls 
through the TAT-1 TASI system during the busy hour. The measured 
values were compared with computed values which were based on meas- 
ured speech activities and talkspurt lengths. 

The computations took into account the two major components of 
speech clipping in TASI, which are: 

1. Signaling clipping, which is the time lost (17 ms) while a new con- 
nection is established; 
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2. Freeze-out, which is the time lost because no channels are available. 

The length and frequency of these clips will vary from eall-to-call due 
to loading, speech habits, or statistical chance. 

In addition to signaling and freeze-out, some clipping is also caused 
by (a) speech-detector response time and threshold; (b) disconnection 
delays caused by control channel crowding during heavy loading. How- 
ever, tests have shown that the TASI speech detector introduces neg- 
ligible speech impairment, and as described later it is estimated that 
control channel overload contributes very little clipping. 

The results of the computations for various trunk-channel combina- 
tions are shown in Fig. 8 for the median call, and in lig. 9 for the worst 
1 per cent case. TASI can operate with a maximum of 36 speech channels 
plus one for control, but because of the demands for special services, 37 
channels are not available on all systems for TASI. Results are shown, 
therefore, assuming different numbers of channels are available to TASI. 

The right-hand scale in Iigs. 8 and 9 gives the estimated db impair- 
ment corresponding to the speech loss shown by the left-hand scale. The 
upper solid curves assume that during the busy hours the circuits are 
carrying calls 100 per cent of the time (A = 1.0); the lower dotted curves 
assume that over the busy hours the average loading of the circuits is 
0.85. 

As shown in Fig. 8, for example, a subscriber in a 74 trunk, 36 + 1- 
channel fully-loaded system will have 1.1 per cent or less of his speech 
clipped during 50 per cent of his calls; ig. 9 shows that this same sub- 
scriber will lose 2.9 per cent or less of his speech during 1 per cent of his 
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Fig. 8 — Median speech clipping. 
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Fig. 9 — Upper 1 per cent speech clipping. 


calls. In a TASI system with an average loading of 0.85, the median 
clipping would be 0.6 per cent or less, and the upper 1 per cent clipping 
would be 1.6 per cent or less. The loading factor varies with season, world 
events, etc. In order to be conservative, and to provide good quality 
speech even during peak periods, the recommended trunk-channel ratios 
shown in Table III assume 100 per cent loading of all trunks. 


TasieE III — RECOMMENDED OPERATING CONDITIONS 





Approximate percentage of speech clipped 








Number of connection channels | Normal max. no. of talker 
available to TASI trunks 
Median-% 1-% 
W+1 47 1% 3% 
28+1 56 1 3 
32 + 1 65 1 3 
36 + 1 74 1 3 








The dots in Figs. 8 and 9 for 47 trunks on 24 + 1 channels represent 
measurements made during the busy hours on the TAT-1 TASTI sys- 
tem. Since the results lie between the A = 1 and A = 0.85 computed 
values, the indications are that the loading of this system, during the 
busy hours, is between 0.85 and 1.0. Direct measurements of circuit 
usage were made and confirm that the average lies in this range. 

Fig. 10 shows the complete distribution of the measured speech 
loss during subscriber calls caused by signaling clipping and freeze-out 
on TAT-1. Both computed and measured curves of the two components 
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Fig. 10 — Estimated and measured clipping distributions on a 24 + 1 channel 
47 beat TASI system. 


are approximately log normal. The good agreement shown in Fig. 10 
between computed values (assuming A = 0.85) and the measured values 
indicates that with respect to freeze-out fraction the operating TASI 
behaves statistically as expected. 


VI. CLIP LENGTHS 


Another important characteristic of TASI clipping is the distribution 
of the lengths of clips a talker experiences. The length of clip a talker 
receives whenever he receives a new connection is composed of two 
major parts; the constant signaling clip (17 ms) and the freeze-out, 
which may vary from 0 ms to 500 ms (upper limit for all practical pur- 
poses), depending upon the instantaneous load. The computed distribu- 
tion of clip lengths a subscriber may experience in a 47/24 + 1 TASI 
is shown in I’ig. 11 for A = 0.85 and A = 1 together with measured 
values on the New York—London TASI system. The measured 1 per 
cent clip length in this system was about 60 ms. Here again the meas- 
ured values lie between A = 0.85 and A = 1, indicating the actual load- 
ing was between these two values. While the computed distributions 
are for a 47/24 + 1 TAST installation, the distributions apply fairly 
well to all the trunk-channel ratios shown in Table III. 
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Fig. 11 — Measured and computed clip length distributions for a 24 + 1 chan- 
nel 47 trunk TASI. 


VII. CONTROL CHANNEL CAPACITY 


As mentioned earlier, control channel crowding under heavy loading 
conditions can delay disconnections and in some cases increase clipping 
by delaying subsequent connections. 

In a 74 talker, 36 + 1 channel TASI system in which each spurt re- 
quires a new connection, it is estimated that disconnects are delayed, on 
the average, about 4 ms each. This results in an estimated increase in 
average freeze-out fraction of less than 0.2 per cent. Since a working 
system switches less often than every spurt — a fact confirmed by meas- 
urement — the actual effect of control channel crowding is believed less 
than the above estimate indicates. Although in some rare cases bunching 
could seriously effect individual talkers, it appears that control channel 
overload contributes negligible clipping. 


VIII. CONCLUSIONS 


TASI has been operated successfully on submarine cable systems to 
provide approximately twice as many good quality message trunks as 
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existed before TASI. The measurements made on an installed TASI 
system have shown that TASI has come close to meeting its engineer- 
ing objectives and that field modifications now going on will bring closer 
agreement between performance and objectives. The close agreement 
between computations of speech clipping and the measured values show 
that TASI theory is well understood and that TASI in the field is con- 
forming closely to theory. 
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TASI Quality — Effect of Speech 


Detectors and Interpolation 


By H. MIEDEMA and M. G. SCHACHTMAN 
(Manuseript received September 19, 1961) 


This article describes tests made to select design parameters for the 
speech detectors in the TASI system. Results of subjective tests carried out 
to determine maximum permissible loading of TASI circuits during busy 
hours are also described. Finally, conclusions drawn from observations on 
a working TASI system are given. These observations indicate that TASI 
ws a more satisfactory method of increasing transatlantic cable capacity 
than alternate methods, such as the use of 2-kc channel banks. 


I. INTRODUCTION 


TASI ( Time Assignment Speech Interpolation) is a new component 
in the telephone system that can approximately double the message 
capacity of existing long submarine cables. With TASI many calls share 
the same facilities, each requiring an available channel only when speech 
is transmitted. In order to recognize that speech is being transmitted 
by the subscriber, a highly sensitive speech detector is required. To 
assign the speech to an idle channel and to connect the proper talker 
and listener at each end requires a rapid switching system. A description 
of the switching system and other related matters can be found in other 
sources.!?3 This paper deals with: (a) the work carried out to select the 
parameters of a speech detector satisfactory for TASI operation and 
(b) the results of subjective tests made to determine the approximate 
effect of the type of speech clipping that can occur in a fully loaded TASI 
during busy traffic periods. 

Some speech is lost whenever the number of individuals talking or 
starting to talk in one direction on TASI circuits exceeds the number of 
available channels. The amount of lost speech must be kept small so 
that the transmission quality is not affected appreciably. On the aver- 
age, less than 0.5 per cent of the total speech is lost due to interpolation, 
as long as the number of calls in progress is held to no more than twice 
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the number of channels. The effect of this loss on transmission quality is 
practically negligible. 

In addition to the speech lost through interpolation, some speech is 
lost during the time required to connect a talker and his listener at the 
other end to the assigned channel. In TASI the switching time (17 milli- 
seconds) has been kept as short as possible, consistent with reliable 
signaling. During the busy hours a subscriber will be switched about 
every second talkspurt; however, during occasional periods of peak load, 
the subscriber may be switched almost every talkspurt. Consequently, 
17 milliseconds will be clipped from a large number of talkspurts during 
the busy hours. In order to minimize the amount of speech lost due to 
switching time, TASI has been designed so that a subscriber loses his 
channel only when it is not needed by that subscriber and when it is 
required by another talker. 

A third possible source of lost speech results from the operate time of 
the speech detector. Whenever a talker has to be reconnected to a 
channel, the speech detector must recognize that speech is present and 
initiate the proper action. The interval between the time that the speech 
starts and the speech detector reacts adds to the amount of lost speech. 
The operate time of the speech detector can be kept small compared 
with the clipping caused by interpolation and connect signaling, but it 
should not be made so fast that the detector operates too often on noise. 

In order to determine suitable speech-detector characteristics, sub- 
jective laboratory tests and field measurements were made on working 
transatlantic circuits. Since the demand for transatlantic circuits ex- 
ceeded the capacity of the existing cable facilities, the schedule for 
developing TASI was of necessity very short for such a complex system. 
The tight schedule limited the type and length of test to the minimum 
needed for reasonable assurance that a satisfactory speech detector 
could be built. Consequently, a straightforward voltage threshold de- 
tector was chosen instead of a more complicated type. The first part of 
this paper describes the test results that led to the selection of the speech- 
detector characteristics. The second part of this paper describes the 
results of subjective tests to determine the impairment in speech quality 
caused by various amounts of lost speech. This work was needed as a 
guide to the maximum number of circuits that can be assigned to TASI 
without affecting speech quality adversely. 


II. SPEECH DETECTOR CHARACTERISTICS 


The ideal speech detector for TASI should operate only when speech 
is present and should not operate when noise and extraneous signals 
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are present. A practical detector must represent a compromise between 
ideal operation on speech signals and ideal rejection of noise signals. In 
addition, the activity, or percentage of the total time that a detector 
is operated, must be minimized. The parameters of the detector were 
chosen to insure that it 

t. operates when very low levels of speech are present, 

av. operates 2 Minimum amount of time on line noise, and 

vii. minimizes the number of times a subscriber must be switched, 
consistent with allowing twice as many calls as there are channels avail- 
able. 

The speech-detector parameters are interrelated and one parameter 
could not be selected without considering the effect on all other parame- 
ters. In order to choose the best combination, a series of subjective tests 
were made in the laboratory to find the speech-detector characteristics 
that provided good results under simulated plant conditions. Later, 
field measurements were made during a large number of transatlantic 
telephone calls to determine the performance of several possible speech 
detectors under actual plant conditions for a wide variety of talkers. 
The field tests measured speech activities, talkspurt lengths, and number 
of talkspurts. The combined results of the laboratory subjective tests 
and field measurements led to the choice of speech-detector character- 
istics shown in Table I. Each of these parameters will be treated separ- 
ately. 

When the speech detector is made too sensitive, the detector operates 
on noise and thereby reduces the possible TASI advantage. Conversely, 
when the speech detector is not sensitive enough, part of the first syllable 
is lost before the detector is operated. A sensitivity of the TASI speech 
detector of —40 dbm at zero transmission level when combined with 
an adequate speech detector hangover (slow release time) results in 
satisfactory speech quality with volumes as low as —31 vu and also 
results in minimum false operations due to noise. As shown on Fig. 1, 
a —3l1 vu talker has a lower volume than almost all talkers on trans- 
atlantic calls. 


TasBLE I — TASI SpesecuH DETECTOR CHARACTERISTICS 





1000-cps sensitivity —40dbm at zero transmission level point* 
Frequency range 500-3000 cps 

Operate time 5 milliseconds 

Hangover (release time) 240 milliseconds ‘‘deferred”’ 

Echo suppression Maximum 13 db 





* The zero transmission level point is a point to which all level points in a toll 
system can be referred. It is analogous to citing altitude by referring to height 
above sea level. The zero level point is at the transmitting toll switchboard of the 
system under consideration. 
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If the speech detector responded only to speech power within a very 
narrow frequency band, the problem of noise activity would be greatly 
reduced. However, for a speech detector with a fixed sensitivity to 
recognize initial consonants from many talkers, it is desirable to have a 
reasonably wide frequency band. Tests have shown that a bandwidth 
of approximately 500 to 3000 eps is suitable for telephone speech. If the 
bandwidth were extended below 500 eps or above 3000 eps, noise opera- 
tions would be increased and the initial speech power seen by the de- 
tector would not be increased sufficiently to permit an offsetting decrease 
in sensitivity. 

To operate the speech detector the power on the line must remain 
above —40 dbm for about 5 milliseconds. Laboratory experiments have 
shown that when the operate time is made as fast as possible, the detector 
sensitivity for equal quality speech can be decreased to about —37 dbm. 
However, the faster operate time results in increased operation of the 
speech detector by noise spikes which increases the activity even with 
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Fig. 1 — Subscriber speech volume distribution on transatlantic telephone 
circuits. 
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the lower sensitivity. Alternatively, an increase in operate time from 5 
to 10 milliseconds requires an additional 3 db increase in sensitivity 
(—43 dbm) with no significant reduction in noise activity. 

The sensitivity threshold of —40 dbm used for the detector is sub- 
stantially above the threshold of hearing; hence, noticeable dropouts may 
occur when the speech power is below the sensitivity threshold. The ear 
is particularly sensitive to the loss of weak syllables as well as clips within 
words and closely connected phrases. Consequently, the speech detector 
should not release until the speech power has remained below the thresh- 
old for a period of time that is comparable to the time of one additional 
syllable. The hangover required for satisfactory transmission of low 
speech volumes varies with sensitivity and amounts to about 240 milli- 
seconds for a —40 dbm detector. A much shorter hangover results in a 
higher TASI switching rate which increases the amount of speech lost by 
connect signaling clipping; a longer hangover results in increased speech 
activity and higher interpolation speech loss. Listening tests have 
shown that good transmission quality for weak talkers can be obtained 
with the combinations of sensitivity and hangover shown in Fig. 2. 

Peaks of noise higher than —40 dbm may operate the speech detector 
and hence add to the circuit activity. Most of the noise peaks last less 
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Fig. 2 — Variation of hangover with sensitivity for constant speech quality. 
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Fig. 3 — Speech-detector hangover vs signal duration. 


than 5 milliseconds and are virtually eliminated by the 5-millisecond 
operate time of the speech detector. Once the speech detector is oper- 
ated, however, the circuit cannot be released until the hangover time 
has elapsed. The effects of those noise peaks, which last from 5 to about 
50 milliseconds, are minimized by the use of the deferred hangover 
characteristic shown in Fig. 3. The minimum hangover time is about 25 
milliseconds. Noise peaks lasting substantially longer than 50 milli- 
seconds are indistinguishable from speech syllables and operate the 
circuit to the extent of the full hangover of 240 milliseconds. The 
combination of 5-millisecond operate time, —40 dbm sensitivity, 240 
milliseconds deferred hangover, and 500 to 3000 cps frequency range is 
about optimum for the expected telephone speech and noise levels. 

In the preceding sections only the operation of the speech detector by 
normal speech and noise incoming to the TASI system has been consid- 
ered. In the actual telephone plant, another group of unwanted signals, 
called echoes, can result in false operations of the detector. Transatlantic 
cable circuits have a one-way delay of about 40 milliseconds and this 
amount is sufficient to require echo suppressors in the four-wire part 
of the plant to prevent subscribers from hearing the echo returned from 
the two-wire part. The echo suppressors have an operate time of about 
12 milliseconds to minimize false operation by noise. At times, the slowly 
operating suppressor will permit small bursts of echo to get past the 
suppressor at the beginning of talkspurts from the distant terminal. 
The distant listener is unaware of these short echoes which are therefore 
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Fig. 4 — Simplified diagram of TASI terminal 


of no consequence in non-TAST circuits. However, on TASI-equipped 
circuits, these short echoes can operate the faster acting (5 milliseconds) 
speech detector and thereby use valuable channel time. The echo path 
is shown in Fig. 4 for an individual trunk in which the TASI terminal is 
represented in simplified form. 

To prevent these short echoes from operating the detector, an echo 
protecting circuit was added to the speech detector and connected to the 
receiving trunk output. When the distant talker is active, his speech 
reduces the sensitivity of the near-end speech detector at a uniform rate 
up to a maximum sensitivity decrease of 13 db as shown in I'ig. 5. This 
value was chosen after consideration of the existing return losses in the 
plant and of the characteristics of the echo suppressors. The objective 
was the elimination of nearly all echo operation under all operating 
conditions. Measurements on many calls have confirmed that echo oper- 
ation of the speech detector is negligible. 


II. LABORATORY TESTS AND FIELD MEASUREMENTS ON SPEECH DETECTORS 


From the foregoing description of the interaction of speech-detector 
parameters, it is apparent that there are several combinations which will 
result in a speech detector capable of recognizing the presence of speech 
and acting on this signal with hardly any effect on the speech quality. 
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Fig. 5 — Effect of echo protection circuit. 


Listening tests in the laboratory were made to determine the speech- 
detector sensitivity required for each type of detector over a range of 
speech levels. These tests were made using tape recordings of a number 
of voices, both male and female. 

Speech from the tape recorder operated the speech detector, which in 
turn operated a gate to connect the observers to the tape recorder. The 
arrangement was such that observers could hear the recorded speech 
only when the detector was active. The observers were selected from Bell 
Telephone Laboratories personnel, both technical and clerical. They 
were asked to determine the minimum sensitivity required for accept- 
able speech quality. The rapid deterioration of speech quality when the 
speech-detector sensitivity was reduced below a certain minimum made 
this adjustment critical for each observer and resulted in a reproducible 
relation between speech-detector sensitivity and speech level. Whilc 
each observer had a well defined tolerance level, the variation of the 
results among individuals was large. The sensitivity selected as satis- 
factory for a given speech detector was the value that satisfied 50 per 
cent of all observers for the minimum speech level of —31 vu at the 
zero transmission level point. 

After determining several combinations of speech-detector character- 
istics that resulted in equal speech quality, various speech detectors 
were connected across transatlantic trunks to determine for each call 
the number of operations and the activity. The average activity of trans- 
atlantic subscribers, determined from measurements on many calls, 
determines to a large extent the possible TASI advantage (ratio of trunks 
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Fig. 6 — Average speech-detector activity during busy hour on transatlantic 
telephone circuits. 


to channels). The number of operations affects how often a subscriber 
is disconnected from and reconnected to the channels. The results of the 
field measurements are shown by the four sloping lines on Fig. 6. On the 
same figure, curve A indicates the locus of points for constant and ac- 
ceptable quality from Fig. 2. This locus permits the selection of the 
detector that provides acceptable quality and minimum activity. It will 
be noted that the point of minimum activity on curve A is not very 
sharply defined. Two speech detectors, points N (240 milliseconds hang- 
over) and M (125 milliseconds hangover), were selected for further 
study. The average activity and talkspurt length resulting from each 
of these detectors were obtained from the measurements on transatlantic 
calls and are summarized in Table IT. 

It will be noted in Table IT that the activity from the 240-millisecond 
detector is higher than for the 125-millisecond detector. However, the 
talkspurt length for the customer using the 240-millisecond detector is 
almost twice as long as for the 125-millisecond detector. As a guide in 
the choice between these detectors, computations were made to deter- 
mine the total amount of speech a subscriber would lose during the busy 
hour if one or the other detector were used in TASI. The computed 
speech loss (interpolation plus switching) associated with each detector 
is shown on Fig. 7. The total speech lost is slightly less for the 240-milli- 
second hangover detector. Although use of the 240-millisecond de- 


1464 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1962 


TABLE IJ — MEASURED ACTIVITIES AND TALKSPURT LENGTHS FOR 
TRANSATLANTIC SPEECH ORIGINATING IN U.S. 


125-millisecond Detector 240-millisecond Detector 
Activity 
Operator 23% 24% 
Subscriber 47% 48% 
*Average during busy hour 88% 39% 


Duration of average detector operation 


Operator 0.4 second 0.7 second 
Subscriber 0.8 second 1.3 second 
*Average during busy hour 0.6 second 1.1 second 


* The results for operators and subscribers are combined in the ratio of 0.375 
to 0.625, respectively. This ratio was found to be approximately the division of 
operator and subscriber time on transatlantic trunks during the busy hour. 
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Fig. 7 — Subscriber speech losses caused by TASI signaling and interpolation. 


tector results in a higher activity than the 125-millisecond detector and 
a slight increase in interpolation clipping, these disadvantages are more 
than offset by the fewer switching operations. 


IV. SUBJECTIVE EVALUATION OF CLIPPING 


The preceding tests showed that a satisfactory speech detector could 
be built whose impairment would be negligible. Additional tests were 
needed to determine how much clipping of all types could be tolerated 
without a significant impairment in speech quality. These quantitative 
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tests were needed in order to specify the maximum number of circuits 
that could be used with TASI. 

Under normal operation, TASI clipping consists of many segments 
which are short compared with an average syllable length. During the 
busiest hours of the day, about one-half of the talkspurts will experience 
some clipping but in most cases, it will be limited to the 17-millisecond 
switching clip. Only one talkspurt in 10,000 will be clipped longer than 
about 0.4 second. Since the listener rarely misses a syllable, the result is 
a slight abruptness in speech which has a negligible effect on intelligi- 
bility. 

The initial series of subjective tests to evaluate clipping in TASI was 
conducted before an actual TASI terminal became available. Preselected 
clips of a fixed length were introduced at the beginning of every Kth 
talkspurt, where K was 1, 2, --- , etc. By eliminating the randomness 
of TASI, the testing time was shortened and the data could be analyzed 
to discover whether the most significant factor affecting the impairment 
of TASI was the total speech lost or the pattern of clip occurrence. In 
this series of tests, the percentage of total speech time lost by clipping 
was varied from 0 to 6 per cent, which encompassed the expected range 
of TASI clipping. A group of observers placed normal business calls 
over circuits in which the amount of clipping would be controlled and 
measured. The observers were asked to base their replies solely upon 
quality considerations and to rate the calls as either ‘‘good,” “fair,” or 
“poor.” The testing methods are described more fully in the Appendix. 

The results of the clipping tests are shown on Fig. 8. Each point rep- 
resents the percentage of calls rated ‘‘good” out of groups of more than 
75 test calls for each condition. The left-hand scale indicates the amount 
of impairment found to elicit the corresponding percentages of ‘‘good”’ 
responses in a previous study of other degradations.* The curve was an 
approximate fit with a second-order polynomial using the least squares 
method. 

Impairment in db has the following meaning: if a telephone subscriber 
is given the choice between the transmission system under test and a 
reference transmission system, how much loss could be inserted in the 
reference path for the subscriber to rate the circuits as equivalent? The 
loss inserted is defined to be the impairment, and is usually expressed in 
db. This impairment has been determined in the past for such things as 
noisy systems, restricted bandwidth, etc., but has not been determined 
directly in this study. The rendering of the present data into db of im- 
pairment rests on the assumption that “percentage good’? means the 
same thing in this and the Coolidge-Reier study. This assumption is 
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AVERAGE IMPAIRMENT IN DECIBELS 
PER CENT OF CALLS RATED GOOD 








FRACTION OF SPEECH LOST IN PER CENT 


Fig. 8 — Subjective impairment vs fraction of speech lost. 
Note: designation 30,2 means a 30 ms clip every second talkspurt. 


unlikely to hold with great precision, so the impairment figures given 
here (and the implied comparability with other impairments) must not 
be considered precise quantities. This reservation also applies to similar 
conversions referred to later in this paper including those involving 
intelligibility figures. The use of such impairments provides a common 
denominator for comparing the effect of these different forms of speech 
degradation. 

After the initial series of tests had been completed, another series of 
tests expanded the measurements to still greater amounts of clipping, 
in order to evaluate the loss in intelligibility that might occur with un- 
usually high TASI loadings. The second series of tests used articulation 
methods and measured the degree of successful communication rather 
than simply quality as before. By this time a working TASI terminal 
was available so the occurrence and duration of the clipping were essen- 
tially the same as in normal TASI operation. The tests used ‘‘phonet- 
ically balanced” words and both the articulation in percentage of correct 
words and the percentage of lost speech were measured. The results of 
three articulation tests for widely different amounts of lost speech are 
shown by the dots on Fig. 9. The upper dotted curve is based on the 
three articulation tests, while the lower dotted curve has been taken from 
Fig. 8. The combined result is indicated by the solid line. The translation 
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Fig. 9 — Subjective impairment vs fraction of speech lost. 


from the percentage of correct words to db impairment made use of 
previously published results. The testing methods are described more 
fully mn the Appendix. 

The foregoing tests indicated that the impairment is determined for 
the most part by the percentage of speech lost, at least for the case of 
normal TASI clipping which has certain definite characteristics: the 
clips are of short duration, they occur only at the beginning of talk- 
spurts and they are irregularly distributed throughout a call. Hence, 
these results do not necessarily apply to the case where long clips of 
many syllables occur very infrequently. 

The results on Fig. 9 can be applied to a statistical analysis of TASI 
clipping to obtain an estimated impairment for an actual TASI system. 
Fig. 10 shows the probable busy-hour impairment for a typical trans- 
atlantic TASI system, which uses 36 interpolation channels plus one 
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channel for disconnect and error checking signals. The estimates on Fig. 
10 are based on the amount of lost speech which has been computed 
from appropriate distributions of subscriber activities and talkspurt 
lengths as a function of the number of trunks available for service. 

The median curve indicates that in 50 per cent of the calls, the im- 
pairment caused by TASI will be less than 2 db as long as the number 
of active circuits is no more than 74 trunks on 36 + 1 channels. How- 
ever, the impairment amounts to about 4 db for one call in a hundred 
and to about 6 db for one call in 10,000. For comparison purposes, 
the approximate impairments of 2-ke and 3-ke underseas channel banks 
are also indicated on Fig. 10. It will be noted that on nearly every call, 
a 2:1 TASI advantage causes less busy-hour impairment than would 
have been obtained through the use of 2-ke channel banks. In addition, 
the TASI impairment decreases rapidly as the number of talkers de- 
creases during non-peak periods, while the impairment caused by narrow 
channels is independent of the traffic load. 


V. DETERMINATION OF THE GRADE OF SERVICE 


The foregoing results are presented in terms of db impairment to 
permit an easy comparison between TASI and alternate methods of 








IMPAIRMENT IN DECIBELS 








AVAILABLE TRUNKS, 36+1 CHANNEL SYSTEM 
Fig. 10 — Busy-hour subscriber impairment due to TASI. 
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obtaining additional channels. It is also of interest from an operating 
standpoint to estimate the over-all grade of service on transatlantic calls 
when the additional practical factors such as variations in net loss, band- 
width and talker volumes are included. For more accurate results, the 
effect of each component should be investigated under all possible con- 
ditions of the other factors such as noise, volume bandwidth, etc., in 
order to look for possible interaction effects. However, in order to find 
the small effects looked for in this study, many samples would be re- 
quired for each combination, and the amount of work involved would 
be almost prohibitive. Past experience indicates that a rough estimate 
of the over-all impairment can be obtained by adding together the in- 
dividual impairments in db and then expressing the end result in the 
“good,” ‘fair,’ and “poor” ratings. 

On this basis the estimated grade of service on transatlantic calls with 
TASI applied to 36 3-ke interpolation channels is shown on Fig. 11 for 
loading conditions that are equal to or greater than the normal operating 
condition of 74 trunks on 36 + 1 channels. For comparison the estimated 
grade of service is given for 3-ke underseas channel banks used without 
TASI as well as the desired objective for all long-distance circuits. While 
this objective is not fully met at present, it is expected that the TASI 
grade of service under normal operation will be improved and will ap- 
proach the desired objective as the result of the current program to im- 
prove local plant transmission. Figures 10 and 11 also indicate that dur- 
ing emergency peak periods, TASI can provide a greater number of 
higher quality circuits than can be realized with 2-ke channel bank 
equipment. 


VI. SUBSEQUENT OBSERVATIONS OF TASI IN SERVICE 


After TASI systems were put in service on transatlantic telephone 
cables, service observations were made on several TASI circuits as well 
as on a reference non-TASI circuit. The results indicated that the per- 
centage of calls rated ‘‘good’’ by a qualified service observer was prac- 
tically the same for TASI and non-TASI trunks. 

The grade of service for greater than normal TASI loading was meas- 
ured when additional trunks were utilized for emergency service at the 
time of a break in one of the transatlantic cables. One transatlantic 
system was utilized with TASI to carry traffic normally carried by both 
transatlantic systems. Table III gives the grade of service for normal 
loading and for two conditions of greater than normal loading. With 
the exception of a somewhat greater “‘poor’’ rating for 90 trunks, the 
observations on operating circuits correspond to those predicted in Fig. 11. 
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Fig. 11 — Grade of service — TASI plus 3-ke channels. 
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Trunks on 36 + 1 channels} Per cent Rating ‘“‘good’’ | Per cent Rating ‘‘fair’’ | Per cent Rating ‘‘poor’’ 








74 90 9 I 
84 87 11 2 
90 74 16 10 








VII. CONCLUSION 


The data from these service observations combined with the other 
results presented in this paper give assurance that most telephone users 
making calls through a TASI system with normal load will not be aware 
that their conversation is being interpolated. This holds even during the 
busy period when any degradation that does exist is at a maximum. 
During emergency conditions when service is partially disrupted, TASI 
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provides additional circuits with only a moderate decrease in grade of 
service. This is felt to be preferable to long service delays. 
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APPENDIX 
Method of Conducting the Tests 


A.1 Initial Series of Conversational Tests 


The random nature of speech and clipping, and the dependence of the 
subjective ratings on the environment in which the clipping occurs in- 
dicated that the tests of clipping should be conducted under the most 
representative conditions. To implement this requirement, observers at 
Bell Telephone Laboratories engaged in normal telephone conversations 
in which the TASI clipping was introduced and the bandwidth and net 
loss were those of a typical transatlantic circuit. These calls were from 
the desks of the observers and consisted of normal business traffic that 
corresponded well with typical traffic on transatlantic calls. Following 
each call they rated the quality of the circuit as “good,” ‘fair,’ or 
“poor.” A total of 45 hours of test calls made by 25 talkers was analyzed. 

The results of the tests were analyzed and yielded the percentage of 
all test calls rated “good” in over-all quality of transmission. The rating 
of per cent “good” for the various clipping conditions was converted to 
db impairment by the previously mentioned relation between subjective 
ratings and received volume. This step assumes that the impairment is 
roughly equivalent to the effect of a decrease in received volume on a 
transatlantic circuit. 

The individual data points are plotted in Fig 8 and the resulting 
curve is repeated as the lower portion of the curve of Fig. 9. The percent- 
age of speech lost is determined by the length of the clip, the frequency of 
occurrence (every Kth talkspurt) and the average talkspurt length. 
Tor these tests, the 125-millisecond hangover detector was used which 
results in an average talkspurt length of about 0.5 second. Thus, for 30 
milliseconds, K = 2, a 30 millisecond clip occurs on every second talk- 
spurt of length 0.5 second, and this condition results in a percentage 
speech loss of 3 per cent. As indicated in Fig. 8, speech losses of 3 per 
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cent and 6 per cent resulted from more than one test condition. These 
different conditions were used to determine whether the manner in 
which a given percentage speech loss occurs results in a significant vari- 
ation in the impairment. Such a significant variation is not apparent 
from the results. Rather, the spread that occurred can be attributed 
more to variability in the testing than to the different ways of producing 
clipping. While each point represents more than 75 test calls rated by 
observers, the estimated confidence interval is only +2 db because only 
3 separate ratings were possible. The solid curve has been used for TASI 
engineering but any use for other purposes should take into account the 
testing methods and the variations that are inherent in such subjective 
tests. 


A.2 Articulation Tests 


Articulation tests were conducted to obtain information about the 
effect of clipping when the amount of lost speech was great enough to 
affect comprehension. Single word articulation tests were chosen to 
provide the severest type of test of TASI clipping. 

These tests were conducted with a list of one hundred “‘phonetically 
balanced’’® words, which have been so chosen that all speech sounds are 
represented according to their frequency of occurrence in normal speech. 
The list of words was recorded using four different voices which were 
combined so that twenty-five words from the list were recorded by each 
voice. Two male and two female voices were used. 

The recorded lists of words were played through a TASI terminal 
which was artificially loaded to insure the desired amount of TASI 
clipping. The clipped speech was played back in the laboratory through 
standard telephone handsets. Six technical and nontechnical observers 
were used for each test. The speech levels at the receivers and the re- 
ceiver noise were adjusted to simulate an average transatlantic connec- 
tion. The observers wrote down the words as they heard them and their 
responses were compared with the original lists to determine the pho- 
netically balanced word articulation in per cent. 

The approximate impairment in db corresponding to the per cent PB 
word articulation can be obtained from previous information that relates 
phonetically balanced word articulation to syllable articulation and 
syllable articulation to volume above threshold. By this means the 
articulation results for the conditions tested are related to the corre- 
sponding volumes, and the variation in these volumes from the reference 
condition yields the impairment in db. 
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Comment on “Discrimination against Unwanted 
Orders in the Fabry-Perot Resonator’ 


(Manuscript received May 18, 1962) 


In the above paper,! Kleinman and Kisliuk state that “Tox and Li 
have investigated these configurations and the corresponding frequencies 
and losses for interferometers consisting of perfectly reflecting plates in 
air. In the usual laboratory interferometer the Fox and Li modes cannot 
be resolved because of insufficient reflectivity of the plates. Therefore 
the role played by these modes in optical masers is not settled.” 

Unfortunately these statements might be interpreted to mean that 
there is doubt as to the validity of the normal mode concept applied to 
maser interferometers. 

We should like to correct the impression that the analysis of Fox and 
Li’ was limited to perfectly reflecting mirrors. As a matter of fact, the re- 
flectivity of the mirrors is completely unimportant in determining the 
normal modes, providing only that it is uniform over the mirrors. 

It is quite true that in most solid state masers the inhomogeneities of 
the medium appear to create so much chaos in the radiation fields that 
correlation with a simple theoretical picture is often hard to demonstrate. 
However, gas masers appear to behave in a reasonably ideal way, and 
both the near-field and far-field radiation patterns for these masers 
appear to confirm the normal mode picture. In the case of such a maser 
equipped with plane mirrors, Herriott* has observed 1.8 me beats which 
correspond well with the expected difference frequency between the 
dominant (even-symmetric) mode and the lowest order odd-symmetric 
mode. 

An even more striking confirmation is seen in Herriott’s pictures® of 
the light distribution across the plane mirrors of his helium-neon maser. 
These show fairly symmetrical multi-lobed distributions, which are at 
least qualitatively what one would expect for low-order transverse 
modes. 

Finally the very beautiful pictures of Kogelnik and Rigrod* have 
demonstrated convincingly the existence of higher order modes in a 
helium-neon maser with concave mirrors. 

With regard to passive interferometers, Ii. H. Scheibe has reported® 
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that in 1955 a “spurious” resonance was observed in a parallel plate 
resonator at 9.4 kmc. This turns out to have been the lowest order inter- 
ferometer mode. Scheibe states that his value of measured Q agrees well 
with the loss curves of Fox and Li and with a curve given by Goubau 
and Christian. Christian and Goubau® have given a number of measured 
values for diffraction loss in a parallel-plate resonator over a range of 
values in N and have shown that these all agree closely with the theo- 
retical loss curve given by Fox and Li for the dominant mode. Good 
evidence for higher order modes exists in a report by Culshaw’ on a 
millimeter wave interferometer in which small subsidiary resonances (Tig. 
6 of Culshaw) appeared at slightly greater reflector separations than the 
main resonances. The observed separations agree within a few per cent 
with what would be predicted from the results of Fox and Li for a TEM); 
mode. 

These findings leave very little doubt that the iterative normal mode 
picture does apply to laboratory interferometers, either with loss or 
with gain. 


A. G FOX 
TINGYE LI 

D. A. KLEINMAN 
P. P. KISLIUK 
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